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Excision frequenc 
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IR TR 
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TR IR 



IR TR 

11 x ID' 1 



<^::::::: :: :::::::::n» 

125 bp (62 bp 
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Plasmids 



Insertion Sequencs 



pfAO-P/L-TTAA 
pfAO-P/L-TTAA2 
plAO-P/L 
pfAO-P/L-13 bp 
p(AO-P/L-22bp - 
pUO-P/L-40 bp X^5Qo 



[PTA Freq uen 



pfAO-P/L-73 bp 
plAO-P/L-212 bp 
pUO-P/L-354 bp 
pUO-P/L-5S9 6p 
pfAO-P/L-2.2 kb 



TTAA ~~ ~ • 

TT A ATT A A " 

^^tStSSS^^^^^ CCT ID ND:33)..V 

GtACTAGCTAQATTAA YsB^SST©* 0 !^^ 00 ^^.. 



j - 6 .x 10- 

2.J ;< I 0 ^ 

'.2.x 10 4 



.3.1 x ip J 
2.9 x ur 5 
j. 2 x I0 J 



^IR- TR inserts- TR tR* 



(iVrztUHpal) 




Apal 



{DraUHpal) 



Sequence Range: 1 to 7670 



100 

AACGCGCGGGGAGAGGCGGTTTGCGTATTGGGCGCTCTTCCGCTrCCTCGCTCAC 
TGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGC 



TCACTCAAAGGCGGTAATACGGTTATCCACAGAATCAGGGGATAACGCAGGAAA 
GAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAG 
>Ori 



GCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCACAAAA 
ATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAA 

400 

GATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCrGTTCCGACCCT 
GCCGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGT 

500 

GGCGCITTCrCAATGCTCACGCrGTAGGTATCTeAGTTCGGTGTAGGTCGTrCGCT 
CCAAGCTGGGCTGTGTGCACGAACCCCGCGTTCAGCCCGACCGC 

600 

TGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAAGCCGGTAAGACACGACTTAT 
CGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGT 

. 700 

ATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGGCTAACTACGGCTACACTAG 
AAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTT 



800 

CGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAGCGG 
TGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGGA 



FIG. 2(C1) 



900 



TCTCAAGAAGATCCTTTGATCTTTTCTAC.GGGGTCTGACGCTCAGTGGAACGAAA 
ACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCT 



1000 

TCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATA 
TGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGC 

<W H K I L S A 
< AMP RESIST 



1100 

ACCTATCrCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCG 
TGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGT 

<G IEAIQRNREDMTAQSGTTYIVVIRSPKGD 

P G L 

< : : AMP 

RESIST 



1200 

GCrGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATA 
AACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTT 

<A AIIGRS GREGAGSKDAIFWGAPLASRLLP 
G A V K 

< : . amp 

RESIST 



1300 

TATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTC 
GC 

CAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTACAGGCAT 

<DAEMWDILQQRSALTLLEGTLLKRLTTAM 
A V P M 

< — ■ AMP 

RESIST 



1400 



FIG, 2(C1) 



CGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGAT 
CAAGGCGAGTTACATGATCCCCCATGTTQTGCAAAAAAGCGGTT 

<TTDREDNPIAENLEPEWRDLRTVHDGMNH 
L F A T 

< AMP 

RESIST 



1500 

AGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCAC 
TCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCAT 

<LEKPGGITTLLLNAATNDSMTIAASCLERV 
T M G D 

< . : amp 

RESIST 



1600 

CCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATA 

GTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAATACGGGA 

<T L H K E T V P S Y E V L D N Q S Y H I R R G L Q E Q G A 
DIRS 

< — : amp 

RESIST. 



1700 

TAATACCGCGCCACATAGCAGAACrTTAAAAGTGCTCATCATTGGAAAACGTTCr 
TCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGT 

<LVAGCLLVKFTSMMPFREEPRFSELIKGSN 

L D L 

< , : _AMP 

RESIST 



1800 

TCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTTCACCAG 
CGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAA 

<EIYGVRAGLQDEADKVKVLTEPHAFVPLCF 
A A F F 



FIG. 2(C1) 



< — AMP 

RESIST. 



1900 

AGGGAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTCAATA 
TTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACAT 
<PILAVRFHQISM CSEQ ID NI: 58.) 
<_ AMP RESIST 



2000 

ATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGA 
AAAGTGCCACCTGACGTCTAAGAAACCATTATTATCATGACATTA 

2100 

ACCTATAAAAATAGGCGTATCACGGGGCCCTGAGGTGAACGAATTGTCACACGT 
AATATTACGACAACTACCGTGCACAGGCTTTGATAACTCCTTCACG 

<RYFYAY.*PARLHVLQ*VYY*SL*RACAKIV 

G E R 

<- : ORF1 N-TERM 

[SPLIT] . 



2200 

TAGTATTCACCGAGTGGTACTCCGTTGGTCTGTGTTCCTCTTCCCAAATAAGGCAT 
TC 

CATTTATCATATACTTCGTACCACTGTCACACATCATGAGGA 

<LI*RTTSRQDTN R K GFLANWKDYVEYWO* 
V D H P 

< ORF1 N-TERM 

[SPUT] 



' " 2300 

TTTTTATTCCATACTTACTTGGCTTGTTTGGGATATACATCCTAAACGGACACCGT 

CCTCTAAAACCAAGTAACTGTTCATCTATGGTCAAATGAGCCCC 

<N KNWV*KAQKPYVD*VSVTR*FWTVT*RH 
D F S G R ' 

< : ORFl N-TERM 

rSPLTTI 



FIG. 2(C1) ,. . 



I 



2400 

TGGAGTGTAATTTTGTATGCACTGATGGATAAAGAGATCCCATATTTTTCTAACA 
GGAGTAAATACATCGTTTTCTCGAAGTGTGGGCCGTATACTTTTG 

<SHLKTHVSPYLS G M N K * CS YICRKRSTHA 
T Y K Q 

< ORF1 N-TERM 

[SPLIT] 

2500 

TCATCCATTCTAAGACATCGTATCAAAAAATCCAAAACGATCCACAGACTCATTA 
CAGAGACGTACACATTGACAAAGATCGATCCAAAGAGGTCATCTG 

<*GN*SMTDFFGFRDVSENCLRVCQCLDIWL 

P * R 

< — . I ORF1 N-TERM 

[SPL1T]_ . 



2600 

tggacaTgtggttatcttttctcactgctgtcattaccagaataccaaagaaagc 

ATAGATTTCATCTTCATTCGTGTCACGAAATGTAGCACCTGTCAT 

<H vhp*rkessdng'sywlfcln*r*eh*siy 

C R D Y 

< ORF1 N-TERM 

rspuri 



2700 

AGATTCCCGAGGTTTCAATGATATCTCAGCATTTGTCCATTTTACAATTTGCGAAA 
TTATCTCATCAGTAAAAAATAGTTTGAAGCATAAAAGTGGGTCA 

<I G S T E I I D * CK.D MKCNAFND* * YFITOLMF 

TP* 

< : : ORF1 N-TERM 

[SPLIT] - - 



2800 



TATATATTGCGGCACATACGCGTCGGACCTCTTTGAGATCTGACAATGTTCAGTG 
CAGAGACTCGGCTACCGCTCGTGGACTTTGAAGTTAAATTCAGAT 

<IYQPVYADSRKSIQCHETCLSP*REHVKF N (SB Q id m 

^D: 59)" 



FIG. 2(C1) 



< ORFl N-TERM 

[SPLIT] ■ 



2900 

ATAAAGACGCTGA.^AATCATTTGATTTTCGCTCTAACATACCACCCTAAAGATTA 
TAAATITAATGAATTATTAAAATACGTACAACAATTGTCTGTAAA 



3000 

TCAACAACGCACAGAATCTAGCGCTTAATAAATGTACTAATAACAATGTATCGTG 
TTTTAATACGCCGGACCAGtGAACAGAGGTGCGTCTGGTGCAAAC 

3100 

TCCTTTACTTTGAACACCAGGGAAACTTCAAGGAGAATTTCCTCCTCTTCAGCAG 
AGTCGGTACCGGTCACCCGGGGATCCCCCCTGCCCGGTTATTATT 



3200 

ATTTTTGACACCAGACCAACTGGTAATGGTAGCGACCGGCGCTCAGCTGGAATTC 
CGCCGATACTGACGGGCTCCAGGAGTCGTCGCCACCAATCCCCAT 

<K Q C WVLQYHYRGASLQFEAS VSPSWSDDG 
G I G M 



LACZ 



3300 

atggaaaccgtcgatattcagccatgtgccttcttccgcgtgcagcagatggcga 
tggctggtttccatcagttgctgttgactgtagcggctgatgttg 

<H fgdinlwtgeeahllhrhstemlqqqsy 

R S I N 



.LACZ. 



3400 

aactggaagtcgccgcgccactggtgtgggccat^ttca.<vttcgcgcgtcccgc 
agcgcagaccgttttcgctcgggaagacgtacggggtatacatgt 



FIG. 2(C1) 



<FQFDGR WQHPG 
P T Y M D 



Y N L E RTGCRLGNES PFVY 



LACZ 



3500 

ctgacaatggcagatcccagcggtcaaaacaggcggcagtaaggcggtcgggat" 
agttttcttgcggccctaatccgagccagtttacgcgctctgctac 

<SLPLDWRDFCAATLRDPYNEQPGLGLWNV 
R E A V 



_LACZ 



3600 

CTGCGCCAGCTGGCAGTTCAGGCCAATCCGCGCCGGATGCGGTGTATCGCTCGCC 
ACTTCAACATCAACGGTAATCGCCATTTGACCACTACCATCAATC 

<Q ALQCNLGIRAPHPTDSAVEVDVTIAMOG 
S G D I V 



LACZ 



3700 

CGGTAGGTTTTCCGGCTGATAAATAAGGTTTTCCCCTGATGCTGCCACGCGTGAG 
CGGTCGTAATCAGCACCGCATCAGCAAGTGTATCTGCCGTGCACT 

<R YTKRS IFLTKGQHQWAH. ATTILVADALTD 
A T C Q 

< LACZ 



3800 

GCAACAACGCTGCTTCGGCCTGGTAATGGCCCGCCGCCTTCCAGCGTTCGACCCA 
GGCGTTAGGGTCAATGCGGGTCGCTTCACTTACGCCAATGTCGTT 

<L L A A E A Q Y H G A A K W R E V W A N P D I R T A E S 
V G I D N 



FIG. 2(C1) 



3900 



ATCCAGCGGTGCACGGGTGAACTGATCGCGCAGGGGCGTCAGCAGTTGTTTTTTA 
TCGCCAATCCACATCTGtGAAAGAAAGCCTGACTGGCGGTTAAAT 

<D L P A R T F Q DRLPTLLQKKDGIWMQSLFGS 
Q R N F 



_LACZ 



4000 

TGCCAACGCTTATTACCCAGCTCGATGCAAAAATCCATTTCGCTGGTGGTCAGAT 

GCGGGATGGCGTGGGACGCGGCGGGGAGCGTCACACTGAGGTTTT 

<Q WRKNGLEICFDMESTTLHPIAHSAAPLTV 
S L N E 

< : ; LACZ • 



4100 

CCGCCAGACGCCACTGCTGCCAGGCGCTGATGTGCCCGGCTTCTGACCATGCGGT 
GGeGTTCGGTTGCACTACGCGTACTGTGAGCCAGAGTTGCCCGGC 

<A LRWQQWASIHGAESWATANPQVVRVTL 
W L Q G A 



_LACZ_ 



4200 

GCrcrCCGGCTGCGGTAGTTCAGGCAGTTCAATCAACTGTTTACCTTGTGGAGCG 
ACATCCAGAGGCACTTCACCGCTTGCCAGCGGCTTACCATCCAGC 

<SEPQPLEPLEILQKGQPAVDLPVEGSALPK 

G D L 



FIG. 2(C1) 



4300 



GCCACCATCCAGTGCAGGAGCTCGTTATCGCTATGACGGAACAGGTATTCGCTGG 

TCACTTCGATGGTTTGCCCGGATAAACGGAACTGGAAAAACTGCT 

<AVMWHLLENDS HRFLYESTVEITQGS LRFO 
FFQQ 



_LACZ 



4400 

GCTGGTGTTTTGCTTCCGTCAGCGCTGGATGCGGCGTGCGGTCGGCAAAGACCAG 
ACCGTTCATACAGAACTGGCGATCGTTCGGCGTATCGCCAAAATC 

<Q HKAETLAPHP-TR. DAFVLGNMCFQRDNPT 
D G F D 



LACZ 



4500 



ACCGCCGTAAGCCGACCACGGGTTGCCGTTTTCATCATATTTAATCAGCGACTGA 
TCCACCCAGTCCCAGACGAAGCCGCCCTGTAAACGGGGATACTGA 

<GGYASWPNGNEDYKILSQDVWDWVFGGO 
L R P Y Q 



LACZ 



• - 4600 

CGAAACGCCTGCCAGTATTTAGCGAAACCGCCAAGACTGTTACCCATCGCGTGG 
GCGTATTCGCAAAGGATCAGCGGGCGCGTCTCTCCAGGTAGCGAAA 

<RFAQWYKAFGGLSNGMAHAYECLILPRTE 
G P L S L 



FIG, 2(C1) 



m 



JLACZ 



4700 

GCCATTTTTTGATGGACCATTTCGGCACAGCCGGGAAGGGCTGGTCTTCATCCAC 
GCGCGCGTACATCGGGCAAATAATATCGGTGGCCGTGGTGTCGGC 

<W KKISWKPVAPFPQDEDVRAYMPCIIDTA 
T T D A 



.LACZ 



4800 

TCCGCCGCCTTCATACTGCACCGGGCGGGAAGGATCGACAGATTTGATCCAGCG 
ATACAGCGCGTCGTGATTAGCGCCGTGGCCTGATTCATTCCCCAGC 

<G GGEYQVPRSPDVSKIWRYLADHNAGHGS 
ENGL 



.LACZ 



4900 

GACCAGATGATCACACTCGGGTGATTACGATCGCGCTGCACCATTCGCGTTACGC 
GTTCGCTCATGGCCGGTAGCCAGCGCGGATCATCGGTCAGACGAT 

<SWIIVSPHNRDRQVMRTVRESMAPLWRPD 
D T L R N 



LACZ 



- - 5000 

TCATTGGCACCATGCCGTGGGTTTCAATATTGGCTTCATCCACCACATACAGGCC 

GTAGCGGTCGCACAGCGTGTACCACAGCGGATGGTTCGGATAATG 

<M PVMGHTEINAEDVVYLGYRDCLTYWLP 
H N P Y H 



FIG. 2(C1) 



< 



LACZ. 



5100 

CGAACAGCGCACGGCGTTAAAGTTGTTCTGCTTCATCAGCAGGATA.TCCTGCACC 
ATCGTCTGCTCATCCATGACCTGACCATGCAGAGGATGATGCTCG 

<SCRVANFNNQKMLLIDQVMTQEDMVOGHL 
P H H E 



_LACZ 



5200 

TGACGGTTAACGCCTCGAATCAGCAACGGCTTGCCGTTCAGCAGCAGCAGACCA 
TTTTCAATCCGCACCTCGCGGAAACCGACATCGCAGGCTTCTGCTT 

<H RNVGRILLPKGNLLLLGNEIRVERFGVDC 
A E A E 



_LACZ 



5300 

CAATCAGCGTGCCGTCGGCGGTGTGCAGTTCAACCACCGCACGATAGAGATTCG 
GGATTTCGGCGCTCCACAGTTTCGGGTTTTCGACGTTCAGACGTAG 

<I.L TGDATHLEVVARYLNPIEASWLKPNEV 

N L R L . • 

< . LACZ 



• - 5400 

TGTGACGCGATCGGCATAACCACCACGCTCATCGATAATTTCACCGCCGAAAGG 

CGCGGTGCCGCTGGCGACCTGCGTTTCACCCTGCCATAAAGAAACT 

<TVRDAYGGREDIIEGGFPATGS AVQTEGO 
W L S V ^ 



FIG. 2.CC1) 



5500 

GTTACCCGTAGGTAGTCACGCAACTCGCCGCACATCTGAACTTCAGCCTCCAGTA 
CAGCGCGGCTGAAATCATCATTAAAGCGAGTGGCAACATGGAAAT 

<TVRLYDRLEGCMQVEAELVARSFDDNFRT 
A V H F D 



LACZ_ 



5600 

CGCTGATTTGTGTAGTCGGTTTATGCAGCAACGAGACGTCACGGAAAATGCCGCT 
CATCCGCCACATATCCTGATCTTCCAGATAACTGCCGTGACTCCA 

<SIQTTPKHLLS VDRFIGSMRWMDQDEL Y S 
G D S W 



_LACZ 



5700 



ACGCAGCACCATCACCGCGAGGCGGTTTTCTCCGGCGCGTAAAAATGCGCTCAG 

GTCAAATTCAGACGGCAAACGACTGTCCTGGCCGTAACCGACCCAG 

<R LVMVALRNEGARLFASLDFESPLRSDOG 
Y G V W 



_LACZ 



5800 



CGCCCGTTGCACCACAGATGA.AACGCCGAGTTAACGCCATCAAAAATAATTCGC 

GTCTGGCCTTCCTGTAGCCAGCTTTCATCAACATTAAATGTGAGCG 

<RGNCWLHFASNVGDFIIRTQGEQLWSEDV 
N F T L S 



FIG. 2(C1) 



) 



.LACZ. 



5900 

AGTAACAACCCGTCGGATTCTCCGTGGGAACAAACGGCGGATTGACCGTAATGG 
GATAGGTTACGTTGGTGTAGATGGGCGCATCGTAACCGTGCATCTG 

<Y CGTPNETPVFPPNVTIPYTVNTYIPADYG 

H M Q 



LACZ 



6000 

CCAGTTTGAGGGGACGACGACGGGATCCGTTTTTTTATTACAAAACTGTTACGAA 
AACAGTAAAATACTTATTTATTCGGACCAACAATGTTTATTCTTA 

<V L L T * E * 

< ORFl N-TERM [S 

<WNSfVVVPDTKOCFQ'SFLLISI*ESWC(S?QiD 

< LACZ__ 



6100 

CCTCTAATAGTCCTCTGTGGCAAGGTCAAGATTCTGTTAGAAGCCAATGAAGAAC 
CTGGTTGTTCAATAACATTTTGTTCGTCTAATATTTCACTACGCT 

. <R *YDETALDLNQ*FGIFFRTT*YCKTRRIN 
* * A 

< : ORFl N-TERM 

rSPLOT 



• - 6200 

TGACGTTGGCTGACACTTCATGTACCrCATCTATAAACGCTTCTTCTGTATCGCTC 
TGGACGTCTTCACTTACGTGATCTGATATTTCACTGTCAGAATC 

<Q RQSVS *TG*RYVSRRYREPRR*KRSRIN* 
Q*FG 

<- ORFl N-TERM 

[SPLIT] 



FIG. 2(C1) 



6300 



CTCACCAACAAGCTCGTCATCGCCTTGCAGAAGAGCAGAGAGGATATGCTCATC 
GTCTAAAGAACATCCCATTTTATTATATATTAGTCACGATATCTAT 

<* W C A R * R R A S S C L P Y A * R R F F M G X (SEQ ID NO- 61 
<_ ORF1N-TERM 

rspuri 

6400 

AACAAGAAAATATATATATAATAAGTTATCACGTAAGTAGAACATGAAATAACA 
ATATTAATTATCGTATGAGTTAAATCTTAAAAGTCACGTAAAAGAT 

6500 

AATCATGCGTCATTTTGACTCACGCGGTCGTTATAGTTCAAAATCAGTGACACTT 
ACCGCATTGACAAGCACGCCTCAGCCGAGCTCCAAGCGGCGACTG 

6600 

AGATGTCCTAAATTGCAAACAGCGACGGATTCGCGCTATTTAGAAAGAGAGAGC 
AATATTTCAAGAATGCATGCGTCAATTTTACGCAGACTATCTTTCT 

RIGHT TERMINAL 

REPEAT > 

6700 

AGGGTTAATCTAGAGGATCCTCTAGATTAACCCTAGAAAGATAATCATATTGTGA 
CGTACGTTAAAGATAATCATGCGTAAAATTGACGCATGTGTTTTT 



. LEFT TERMINAL 

repeat; > • 

6800 

atcggtctgtatatcgaggtttatttattaatttgaatagatattaagttttatta 

TA 

TTTACACTTACATACTAATAATAAATTCAACAAACAATTTAT 

6900 

TTATGTTTATTTATTTATTAAAAAAAAACAAAAACTCAAAATTTCTTCT 
ACAAAACTTTTAAACATTCTCTCITTTACAAAAATA,A.ACri^ 



FIG. 2(C1) 



7000 



TGTACTTTAAAAACAGTCATGTTGTATTATAAAATAAGTAATTAGCTTAACTTAT 
ACATAATAGAAACAAATTATACTTATTAGTCAGTCCAGAAACAAC 

<D T W F C S 
< ORP1 C-TER 



7100 

TTTGGCACATATCAATATTATGCTCTCGACAAATAACTTTTTTGCATTTT^ 
GATGCATTTGCCTTTCGCCTTATTTTAGAGGGGCAGTAAGTACA 

<Q CMDINHERCIVKKCKKCSANAKRRIKSP 
C Y T C 

< : : : ORP1 

C-TERM ■ • 



7200 

GTAAGTACGTTTTTTCATTACTGGCTCTTCAGTACTGTCATCTGATGTACCAGGCA 
CTTCATTTGGCAAAATATTAGAGATATTATCGCGCAAATATCTC 

<Y TRKKMVPEETS DDSTGPVENPLINS INDR 

L Y R 

< . : ORF1 

C-TERM 



7300 

TTCAA AGT AGGAGCTTCTAAACGGTTACGCATAAACGATGACGTCAGGCTCATGT 
AAAGGTTTCTCATAAATTTTTTGCGACTTTGAACCTTTTC 

<K LTPAELRNRMFS STLSMYLN R M F K K R S Q 
V K E. G K 

< ORF1 

C-TERM 



- - 7400 

TGCTACTGACATTATGGCTGTATATAATAAAAGAATTTATGCAGGCAATGTTTAT 
CATTCCGTACAATAATGCCATAGGCCACCTATTCGTCTTCCTACT 

<SSVNHSYIIFSNICAINIMGYLLAMPWRNT 

K R S 

< ; ORE I 

C-TERM 



FIG. 2(C1) 



7500 



GCAGGTCATCACAGAACACATTTGGTCTAGCGTGTCCACTCCGCCTTTAGTTTGA 
TTATAATACATAACCATTTGCGGTTTACCGGTACTTTCGTTGATA 

<C TMVSCMQDLTDVGGKTQNYYMVMQPKG 
T S E N I 

< ORF1 

C-TERM 



7600 

GAAGCATCCTCATCACAAGATGATAATAAGTATACCATCTTAGCTGGCTTCGGTT 
TATATGAGACGAGAGTAAGGGGTCCGTCAAAACAAAACATCGATG 

<S ADEDCS SLLYVMKAPKPKYSVLTLPGDFC 
F M ST ' 

< . : ORF1 

C-TERM 



TTCCCACrGGCCrGGAGCGACTGTTTTTCAGTACTTCCGGTATCTCGCGTTTGTTT 
GATCGCACGGTACC (SEQ ID NO: 57) 

<G V P R S R S N K L V E P I E R K N S R V T G (SBQ ID NO: 62) 

< ORP1 

C-TERM 



FIG. 2(C1) 



FlCr. <3Ct3) 



?IA0-?/L-L«nbda-2 . Z'nb 

Sequence Range: I Co 599 q 



^CTCAAA^CC^AATAC^ 
I 

f 

GCCGCGTTGcrGGCGTTTTTCCAI^.GGCTCCGCCCCCCrGACGa^^ 



40O 

TCTCCCTTCGGGAAGCGT 
GCCGCTTTCTCAATGCTCAC^ 

TGCGCCTXAICCGG^CTATCGn^ 

^GTAGGCOTGCTACAi_*^^ 

C«^A*AGAGTTGGTA^ 

TCTCAAGAAX^TCCTTTCMCTTTXCT^ 

TCACCTA<_X_CT^ 

<W H K r - L S A 
< AMP RESIST 



ACCTATCTCAG^GATCTGTCTAXTTCGTTT^^ 

<G rEAXQRNRSD M TAQSGTTYIVV I. RSP jTT^ Tff 
< _AH? RESIST 

GCTGCAATSAXACCGCG^^ 

<AAZIGRSG R EGAG5KDAIF WS APiA3Rt.l.p G iTv k 
< — AMI? RESIST ' 

<DA2MWD i r.QQaSALTl,LSSTLLICR_TTAMAVPM 



AMP RE3X3T 



CGTGCTCTC-ACGCZCGTCGTT^^ 

<TT D RBD tf P IAS_*_25 8WRD_STV_D6M_-__FA¥ 
< _ AMP RESIST ' 

AGCTCCTTCGGTCCTCCGAT C GTT G TCAG AAGTAAGTT GGC CGO\GTGTT ATCACTCATGCTTAT G<^CAGCAC T GCATAATT C T C T TAC TGT CATi 1 ^ ° 



<L 2 K. P G G I X T L L L N A A T N D S M T "V A A V C lT^^V 
C AtCP RESIST 



CCGTAAiGATGCTTTTCTGTGACTGGTGAGTACTCA^ 

<rZ.a_-2TVPSYSVr,DMQ3Y Hr=lRGI.QSQGADlR 3 
< AMP RESIST 

1700 

^ AAIAC C GC ^ACAXAGCASAACT TT3J^AAI^TGC TCAT CATTGGAAAAC STXCTTCGSGXSCGAAAACT^TCAAGGAT C TTAC C GCTGTT. GAGATC CAGT 
^VAGCLLVKFTSMMPFRSEPRFSSLIKGSHliOI,- 
<r A M? KJ53X3T_ 

J9<MGTJ_ACCCACTCGXG^ 

^S,---! Y G V R A G L .QV-0. 3- A- D K V Iv V 2- -B H A ? V ? £, _i^r ..a. A F F 

< - AMP RESIST 



1900 

AGSGAATAAGGGCGACACGGAAAI^TTGAATACTC^^ GAAGCATTTATCAGGGTI^^ r 

<eiLAv RS * H QrsM (SEQ ID NO: 58) 

< -AMP RSSIST_ 

2QO0 

A:TCTGAATGTATTTJU2AAAAAT>J^C^ GCCACCTGACGTCTAAGAAACCATT^J^ 



ACClMAAAAATAGGCGTATCACGeGGCCCTGAGG^ 
<R Y F Y A Y * P A R L HVLQ*VjfY* S L * R A C ■ A K I** V G 2 R 
c ORF1 N— TERM fSPI.IT] 



<Lr*RTTSRQDTlXRiCGFLAirWKDYVKYWQ* V D H D 

< ORF1 N-TERM [SPLIT] 

TTTTrAXTCCATACTTACTTGGCTTC^^ 
<HR NWV*5CAQKPYVD*VSVTR* FVTVT'RHD F S G R 
<_ N-TKKM [SPLIT] - 

2<JOO 

<S HLKT3VS P YL S G K IT K * C 3 Y I C RKRS T H A T Y K 

< ORF1 N-TERH [SPLIT] 

2SQO 

TCMCCATTCTAAGACATCGrATCAAAAAATCO^AAACG 

<* GN'SMTD FFGFRDV SE HCL RVCQCLD IWL P » ~R 
< ; ; ORF1 K— TERM [SPLIT] , 

2600 

T GG^CATGTGGTTATCl 111 CT<^CTCOT^T1ZAXTACCAGAAXAC CAAAGAAAGCAT AGATTTCATCTT CATTC GTGTC^GAAATGOrAGCAOZTGTi^AT 
<HVHP* RKSS SDNGSYWLFCLN*R*KH*SIYCRD Y 
< . _ORFl fci— TERM [SPLIT] " 

2700 

AGATTCCCGACGTTTCAXT^XATCT^^ 

<IGSTE.IID 4 -CKD»KCMAFHD* * Y PITQLttFTP * 
< 0RF1 fif— TERM [SPLIT] . 

2800 

TATATATTGCGGCACATACGCGTCGG^CTTCTTTSa^ 

<IYQPVYAD5 RKS IQCH2TCLS P»R2RVKPn( SEQ ID 
< ORSa S-TERW [SPLIT) ^ j 

2900 

ATAAAGACGCTGAAAATCATTTGAX^TCGCTXTI^^ ATTGTCTGTAAA 

3000 

TC^\ACAACGCACAGAJtXCTAGCGCTTAATJ^^ 

3100 

TCC^T^ACT*i7L'GAACACCAGGGAAACTT C^\AGGAGAA!PTTC CTCCTCTTCAGCACAGTXIGGT2*CC GGTCACCCGGGGAXCCCCCCTGCCCGGTTAITTATT 

' _ 3200 

ATTTTTGAlIAnCAGACCAACTGGTAAT 

<K Q ' C W V L Q Yfl Y R G A 3 LQ ? 5 A 3 V S P S OTS DDGGIGM 
< LACE 

330 0 

ATGGAAACCGTCGA2I=*TTCAGCCATGTGCCTT^ 

<3 FGDIM.LNffTGUEAELLHRES T 5H- LQQQSYRSI M 
< LACS 

3400 

AJ^TGGAAJ^CGCCGCGCCACTGG^GTGGGCCATAA^ 
<:F Q F DGRWQH PGYNLBRTGCRLGK33 ? F V Y ? T T M D 
< LAC 2 

3500 

CTGACAA^GGCAGAJHCC^l^GCGGT^ 

<2 L ? L DCTR D ? C A A T L R □ PYNEQPGLGLWtfVRSA V 
< LACZ . 



. LAjCZ * c Q 



G<^C^JVCGCTGCTTCG^CT(^XXATGGCCCGCCGCCrTCC^ *8O0 

A K 

LACS 



— T.a.r-T - n e 



< L ^^ vw - i * v -l-IjWLQG.2V 
. T-jr7 <o- w 



EGSALP KGDli 
,LAC2 



C IAJC2 3 - Q c F Q Q 

: : . .lacz_ 



*<»^CGTMGCCGACCM ir -_^_^__ , „ _ _ 4S °° 

E, 3 
LAC 2 



r.^^-»r ~ - Q 



CC^VACGCCTGCCAOTJ^riA^ 

<K,AQ^ Y KAFGGI.S>XGMAHAYSC i ZLPRr E G p~ 
— LACS 



rv^, 3WK pvAP = PQ DSDVRA ^ MpcriDTATT 



LAC2 



LACS 



aoo 

CAGC 
L 



— LACo " J ' * 

4 - " ** -A'-s- 3 . D V V V L G 1' 3,_:j3. C.L T Y W L ^ H"">L_^Y q; 

„ LACZ — 



\ 

J 



< T.-\(-<7 - * *J 



^GGTTAACGCCT^^ 
< LACZ 



< <ItTGDXTHLB V V A .RrLWPIZA$ V2 .xeiiE- V M ^ 



T ^CGCGATCGGCATAACC^ 

<TVRDAYGGRSDriEGGFPATGSAVQTEGQ xTTV^ 

LAC 2 



<SlQTTPKHLLSVDRFrGSMRWMDQ.DBX.YSG D S W 



C : UVC3 



ACGCAGCACCATCACCGCGAG^GGTTTTC 

LACZ 



^RLVKVALRKZGARLFASLDFESPLRSDQGYGV^T 



CGCCCGTTG^CCA^ 

<RGMCWLHFASNVGD2-XIRTQGSQI,WSEDVNFr 



iVGTSUUTAACCCGTCGGAX^ 

<Y C G T PNETPV P P. P N V T t P Y T V "if* T • r I PAD Y G H M o 
< - LACS 

6O0O 

•XACTTATTrATTCGGACCAACAAI GTTTA.TTCTTA. 

<V L L T * E * 




< ORFl N-TSRM fs 

svc ( SEQ IETTTO 
NO: 60) 



\TAGTCCTCTCvrGGCAAGGTCAAGATTCTGTTAG^ 

YD STALDLMQ* F G I F P R T "* T * r C KTRRI if - * A 
_ ■ ORFl N-TSRM L SPLIT I 



- < ^'^ACTTCAT GTACC TCATCTATAAACGC T TCT T C7GTATC GCT CTGGACGTCTT CAC TTACGTGATCTGATATTT CACTG T CAGAA!F C 
- ORFl N-TSRM (SPLIT] ■ 



CTCJVCC^ZAACHZTCGTCATCGCCTTG^ 
< vr C A R * RR.\S S CL P Y A * R5.F?S<GX* e ™ NT ^ - _ , 

< „0R?1 M-T3RM C SPLIT] (SEQ ID NO I 61) 

AAC^AC^AAATAIA^T^^ 

RIGHT TERMINAL REPEAT > 



<XRI*G«D?*WSSLtfYA*SPTDQRi,YIVSR 
< EA31 (235); CODON_START-1,- DB_XRZE=PID:G215131; TRA [SPLIT] 



< MRNA-PL (ALT.; VIA T ' J 4 TS.1MZXA.TOR) (SPLIT J 

c KKNA-PL {ALT.; VIA T 1 o3 T5RMXNATOR) (SPLITl" 

<: : MRNA-PL (ALT.; VIA T'J2 TERMINATOR) (SPLIT] " 

< MRNA-PL (ALT - ; VIA T'Jl TERMINATOR) (SPLIT] ~ 



saoo 



T GTCGAGCT TAATAT T C TAT ATT GTTGTTC TTTCT GCAAAAA\GT GGGGAAGT GAGTL^AT GAAATXATT^ CTAACAT ^T AXC" 7 ^ CATCAXAC CTT CO^AG- 

<D L K I W - rXTR3APrHPLSYH?*K*CCDADYRG L 
< SA3 1 (296); CODON_START= 1 ; DB_XREF=PID : G2 X5 13 1 .; TRA [SPLIT ] 



_MRNA-PL (ALT.; VIA T'Sa TERMXMATOR) (SPLIT J 

_MRNA-?L (ALT.; VTA T'J3 TERMINATOR) (SPLIT]" 

_MRNA- PL (ALT.; VIA T ' X2 TBRMIMATOR) (SPLIT]' 

_WRNA-PL (ALT.; VIA T'Jl TEHKIKATOR) [SPLIT] 



CATTTATTAAGC^7IT?CGOTATAA^ 

«!^^.m J 7cLL (sto - id - nc>! " ,64) A 

<S YTRAPL?LKXHT3~S»RQ .SNDD S K F R NT 

< (525)/ CODON_START-1; DB_XR3F^PID : G215132 ; TRA 

< : ; MRNA-PE. {ALT.; VIA T'T-l TERMINATOR) [SPLIT] ^ — 

< ' MRNA-PL (ALT.; VTA T r 0T3 TERMINATOR) (SPLIT] : 



_MRNA-PL (ALT.; VIA T ' J2 TERMINATOR) (SPLIT ]~ 
_MRNA-PL (ALT.; VIA T'Jl TERMINATOR) [SPLIT]_ 



TCGACCTTCTAJ^CCTATCTG^ 

<SRKIRDSW + LKKS HM*LFARF*RVAIir.PPT nr * 



V A 

< EA5 9 (525)7 CODON_START~l ; DB_jXREE=P ID : G2 15 L3 2 : TKA 

< MRNA-PL (ALT.; VIA T' «T4 TERMINATOR) (SPLIT] ~ 



< 



_MRNA— PL (ALT - ; VIA T 1 J*3 TERMINATOR) [SPLIT] 
MRNA-PL (ALT.; VIA T ' J2 TERMINATOR) (SPLIT] * 



_MRNA-PI (ALT.; VTA T'Jl TERMINATOR) ( SPLIT] 



TTTGTCACTTCAAGTAAAAACACCTOVCGAG 

<KOS - TFVG-SirrCRL2-RID*-YGSPYYKN S~~R e" v 

SA59 (525) ; CODON_START=l : D3_XRBF=P ID ; G2 15132 ; TRA 

MRNA-PL (ALT.; VIA T ' J 4 TERMINATOR) (SPLIT] " 



< MRNA-PL (ALT.; VTA T * J~3 TERMINATOR) (SPLIT] 

< . MRNA-PL (ALT . ; VIA T 1 J2 TERMINATOR) [SPLIT] 

< . — MRNA-PL (ALT.; VTA T ' JI TERMINATOR) (SPLIT] 



GTAC^CTTTCCACATGKZA*^ 

<TPSBVHLISSGRATS3?F.3NCNNSCWQTCRTLR^ 



_RA5 9 ( 525) ; CODON_START-l ; DB_XREF*PID : G2 15132 ; TRA_ 
_MRNA-PL (ALT.; VIA T'J4 TERMINATOR) (SPLIT) 



_MRNA-PL (ALT.; VTA T 1 J 3 TERMINATOR) [SPLIT]" 
_MRNA-PL (ALT.; VIA T ' T2 TERMINATOR) (SPLIT]' 
_MRKA-?1 (ALT . ; VXA T'Jl' TERMINATOR) ( S PLIT } " 



730O 



T AAXGTTC GTAAAAAAGC1A.GAGAGCAAAGOT GGAT GCAGATGAAC CTCT GGTTCAT CGAATAAAACTAAT GACTTTTC GCCAAC GACATCTAC TAATC TT 
<XNTFFCLAFTSASSGRT*Slir3IVKRWRCRSl5 

< SA5 9 (525); CODON_£TART~ 1 ; DB_XREF° P ID : G2 IS 1 3 2 ; TRA 

c MRNA-PL (ALT.; VTA T ■ J" 4 TERMINATOR) (SPLIT] 

< _ . MRNA-PL (ALT./ VTA T ' «73 TERMINATOR) (SPLIT] ■ 

C MRNA-PL (ALT.; VLA T ' J2 TERMINATOR) [5PLIT] ~ 

< — __MRNA-?L (ALT.; VIA T'Jl TERMINATOR) [SPLIT] 



7400 



^TGATAGTAVwTAAAACAAT^GCATGTCCLA^ 
<HYYIFCNCTWLSITSAS IZPYQ^LVI * HI ' - R G 5~" 

c — ZLAo3 (525); CQDON_5rART- 1 ; D3_XR2F-?ID :G2lSl32 ; TRA 

C _MRNA-PL (ALT.; VTA T ' J 4 TERMINATOR) (SPLIT] 

MRNA- PL (ALT . : VIA T * u 3 TERMINATOR) ( S PLIT ] - 



MRNA-?L_(ALT. ; VTA T ' J2 TERMINATOR) -__('SPLT7] 



< MRNA- ?L (ALT . ; VIA T * JI TERMINATOR.) " ( 3 PLIT ] 



< 
< 

< 
< 



GAATCT GTG<7T TCATTACST CTTAACTCTT CATAT TT AGAAAT GAGGCTGAT GAGTTCC^iXATTTGAAAAGTTT TCATCACTACTTXiG^' 7 ^^T^c~A T ' r- r- 

_EA59 (S2S); CODON^STAKT==1: DB JCREF=PrD : G21S132 ; TRA 

MRNA-PL (ALT. 7 VTA T T J4 TERMINATOR) (SPLIT] : 

MRNA-PL (ALT . ; VXA T T T3 TERMINATOR) ~~ 

KRNA-PL CALX-/ VXA T ' JT2 TEHiCJBIOR) 

M RMA-PL (ALT.; VIA T r Jl TERMINATOR) 



[SPLIT)_ 
[3PLIT]_ 
[SPLIT] _ 



TTCAAGCCAGAGTTGTC 

A L T T K 



<_ 
<_ 
< 



*RSE7LVY15Z>KXft£B5Ka 
EA59 (525); CODON_S TART* 1 DB_XREF*>PID : G215132 ; TRA 

_MRNA--?L (ALT . ; . VXA T ' J 4 TBRMINAT OR } [SPLIT] 



7600 

tTTTXAAAC TAT TGCX O 
I ?C T * Q q 



_MRNA-PL (ALT- ; VXA T ' J3 TERMINATOR) [SPLIT] ~ 
_MRNA-PL (ALT - ; VIA T f J2 TERMINAT OR ) [SPLIT]" 
Jr£RNA-PL (ALT.; VIA T r Jl TERMINATOR) [SPLIT]" 



BA5 9 (525); CQDQN_STA=C?«=1 ; DB_JCRBF=P ID : G2 15 13 2 ; TRA_ 

J3RNA-PL (ALT.; VIA T'J4 TERMXNATQR) ( SPLIT 1 

__MRNA-E>L (ALT.; VIA T'J3 TERMINATOR) [SPLIT] 

_MRNA-PL (ALT.; VIA T'J2 TERMINATOR) [SPLIT] _ 

_MRNA-?L (ALT.; VIA T * Jl TERMINATOR) [SPLIT] 



< 



AAACAAGCt^TCGAAAATATCCCTTTXSGGATTCTT^ 



<_ 
< 



KPNK:VRYT- i *X*LE'PI 
EAS 9 (525);. CODON_5TA5T= 1 ; DB_XR2F=P ID : G2 15 132 ; 

MRNA-PL (ALT.; VTA T'J4 THRMTNATOR) [SPLIT] 

MRNArPL (ALT - ; VXAT'J3 TERMINATOR) [SPLIT]" 

__MRNA-PL (ALT - ; VIA T T J2 TERMINATOR) [SPLIT]" 
M RNA-PL (ALT.; VIA T ' Jl TERMINATOR) [SPLIT]" 



Q K 



790O 

CTGGGTTGGTGATTGCACC 
* P Q H W C w 



< 

<_ 

<_ 
<_ 
< 



M-TJTVWEARYFCSHEHQAY 
_SA59 (525); CODOM_START-l; DB_XRE^=PlD : G2 15 13 2 ; TRA 

MRNA-PL (ALT. ; 

MRNA-PL (ALT. ; 

MRNA-PL (ALT.; 

MRNA-PL (ALT.; 



7900 
F T 



VIA T'J4 THRMTNATOR) 

VIA T'J3 TERMINATOR) 

VIA T»J2 TERMINATOR) 

VXA T' Jl TERMINATOR) 



C SPLIT] 
[SPLIT] ~ 
[SPLIT] ~ 
[SPLIT] ~ 



AXP^TTAAATCACTGAATCCG 
<Y N F + Q X R 



9000 

■TGACAA3^TCTGGC3VAACCA!TTTAACACACGTGCGAA 
* I PL P FRVI RAFWKVC TRVTw" 

CODON_START-l ; D3_XREF=PXD : G2 15 13 2 : TRA 

(ALT.; VXA T»J4 TERMINATOR) 
MKNA-PL (ALT. r VIA T* J3 TERMINATOR) 
MRNA-PL (ALT.; VTA T » J2 TERMINATOR) 
MBNA- PL (ALT.; VTA T * Jl TERMINATOR) 



S C K K * 
EA59 (525) : 
_ MRNA-PL 



[SPLIT] 
[SPLIT]" 

[SPLITS 
[SPLIT] 



310Q 

GAAT^CTGAAAGAGT^JZCCCTCTAAGTAA^ 

<3NRrSN-GR^TILH*PRK-*S*HRSI35AM ' 

< EA59 (525 ) CODON__START>-l; DB_XRSf^PIB : G2 15 132 ; TRA 

< MRNA-PL (ALT.; VXA T ' J 4 TERMINATOR) [SPLIT] 



E* G 



_MRNA-PL (ALT. ; VUL r'J3 TERMINATOR) [SPLIT]" 
_MRNA-PL (ALT.; VIA T ' J2 TERMINATOR) [SPLIT] " 
_MRNA-?L (ALT-; VIAT'Jl TERMINATOR) [SPLIT]" 



5200 

AGCTTTAAGAAGGTTATGTTTAAW^ 

<S * S ? * T * ?• W R K I QQS * CL L ?. L ? 2 V N » P^*" 3 

< ,: S AS 9 ( S 2 S ) ; CODON_3TART=l ; DE_XRSF=? ID : G2 IS 1 3 2 ; TRA 

< MRNA-PL (ALT. 

< . MRNA-PL (ALT. 



VIA T' J4 TERMINATOR) (SPLIT] 
VXA T'J3 TERMINATOR) [SPLIT] " 



_MRNA-?L (ALT.; VIA T * 12 T2RMIHATOR) [SPLIT] ~ 
_MRNA- PL (ALT . ; VXA T ' II TSRMItfATOE) ( S ?LIT 1 " 



CTiSAAXTTmATCTATT^T 

<OZX-RWXFX.aKSftRlCSEtC|-Vir~CZIl*K K . K * W 
.SA59 (52S) CODQN -.'START-l; DB XREF^PID : G 2 15 132 ; TRA 



=PID:G215132; TRA 

C MRNA-PL (ALT.; VTA X' «J4 TSHMI2CATOR) [SPLIT] 

< MRMA-PL (ALT . ; ^TA T ' J"3 TERMINATOR) (SPLIT] 

5 MRNA-PL (ALT.; VIA T ' 12 TERMINATOR) [SPLIT] 

< , M RNA-PL (ALT- ; VU T'Jl TERMINATOR) [SPLIT] 



3300 
GCTCAT 



C SA59 (525); CODON_START-l; DB_IREE -PID: G2 15 L3 2 ; TRA ( SEQ 



______ - I o 

_MKNA-PL (ALT . ; VIA T ' J" 4 TERMINATOR) [SPLIT] ~~ 'NO* 6S ") 

_MRNA-?L (ALT. i VIA T ' J~3 TERMINATOR) [SPLIT] '. ' 



< , MRNA-PL (ALT.; VIAT'J2 TKBHINAIOR) [SPLIT] 

< _MRNA-°L (ALT.; VTA T ' Jl TERMINATOR)" [SPLIT] ~ 



TCmCAATAACA-TTAAACT^ 

< T _ _ MRJOw-PL (ALT . ; VIA T"J4 TERMINATOR) [SPLIT] ' 



_MR.tA.-PL (ALT . ; VTA T f J3 TERMINATOR) [SPLIT] 
JMRNA-PL IALT _ ; VTA T r J2 TEHMINAIOH) [ SPLIT] " 
_MRNA-FL (ALT.; VTA ? ' Jl TERMINATOR) [SPLIT]" 



TCTCXATTACAIACGTTTTCC^ 

< MRNA-PL (ALT.; VIA T ' J 4 TERMINATOR) [ SPLIT ] 

c . MRNA-PL (ALT - ; VTA T»J3 TERMINATOR) ( SPLIT. ~ " 

< . , . KRNA-PL (ALT.; VTA T 1 T2 IBKMTMAT O R ) [SPLIT] 

< ! MRNA-PL (ALT.; VIA T'JZ TECTCCNAJOR) ( SPLIT] ~ 



TAACCCLACTCTTAXT-^ 

< . MRNA-PL (ALT.; VL\ ?'J4 TERMINATOR) [SPLIT] 

< _ ____MRNA-PL (ALT. ; VTA T * J3 TERMINATOR) [SPLTT) " — 

c . X RNA-PL (ALT - ; VIA T ' J2 TERMINATOR) [SPLXT] " ' 

< ; MRNA-PL (ALT.; VTA T ' JT TERMINATOR) [SPLIT] " 



CTGAA£AAr^GTTOIAAGTACTAAXAAGCCG 

< _ MRNA-PL (ALT.; VTA T ' J4 TERMINATOR) [SPLIT] 

< _ MRNA-PL (ALT.; VTA T * J3 TSHMXNATQR) [<"t.tt] " 

C — . MRNA-PL (ALT.; VTA T»T2 TERMINATOR) [ SPLIT] ~ ~ 

C MRNA-PL (ALT.; VIAT'Jl TERMINATOR) [SPLIT] " 

TTCACTACAGTTATGGCGG^^ 

C MRNA-PL CALT.; VTA T ' J* 4 TERMINATOR) [SPLIT] 



9900 

:tctta 



_ .MRNA-PL (ALT.; VTA T'J3 TERMINATOR) (SPLIT) 

< : MRNA-PL (ALT.; VIA T » J2 TERMINATOR) [SPLIT] 

< ___MRNA-PL (ALT.; VIA T ' Jl TERMINATOR) [SPLIT] ~ 



C T^SGTTATG<ZAGGTCGTAGTGGGTGGCACAC1AAAG CTAGATTAACCC TAGAAAGATAAI GATATTGTGACGTACGTTAAAGA^TAATCAT GCGTAAAATT r* 

< MRNA-PL (ALT.; VTA T * J4 TERM 

< M RNA- PL (ALT . ; VTA T ' J3 TERM 



< MBNA- PL (ALT.; VIA T ' J2 TER M. 

< M RNA-PL (ALT.; VTA T ' Jl TERM 

LZET TERMINAL REP3AT 



ACGCATGTGTTTTTATCGGTCTGTATA^ 



2200 

A^AAACAVrTrTAJTrTATGTTTATTTA^ 

_ 330 0 

-^^^ACTTATTTTGtACTTTAAAAAC^ 



Ho. £(ta) Conf. 



<T W F C 3 QCM D I NHE R'CIVKKCJCKC S A W A K R R I £ S 
< QRF1 C-TSRM 



9SOO 

< ORc 1 C-TERM 



CGCGCAAA3^cnX;in:CAAAGTA£^?vGC^ 
<KLy^:<LT?A2LRNRHFSSTr.SKiTLNRKrKK-R S Q 



ORF1 C-TRHM 



AACCTTTTCTCCCTTCHZrACTGAC^^ 

<V X B S K S SVM3S Y I IFSHICAINIMGT L LAKPWr 
< , ORF1 C-TEFW 

980 O 

TTCGTCTTCCTl^TGGAGGTC^ 

<NTKRSCrMV3CMQOLTDVGeKrQHYYMVMQPKG.T 
< QRF1 C-TSFM ; ^ 

9900 

XAJ3TrxTCGTTGATAtSAAj^V^^ 

<SEMISADKDC53LliYVMKAP2CPKYSVlTI.p© D F 

< : oRra C-THPM 



ACAAAACA.T C GAIGT X C CCACTGGC CTGGA.GCGA.C TtSTTTTT CAjGTACTTC CGGTA3XTTC GC GT T'TGTTT GAJTC GCACG<*TAlCC ( SEQ IDS NO * 63 ^ 

< orfi c-tbrm • ID NO: 66) 




FIG .3 



ITR Cartridge Sequence 



Sequence Range: I to 70 7 



SO 

G GAT C C CAT G C G T C AAT T TTAC G CAGAC TAT C T T T C T AGGG T T AAT C TAG 
RIGHT TERMINAL REPEAT > 

100 

CTGCATCAGGATCATATCGTCGGGTCTTTTTTCCGGCTCAGTCATCGCCC 

150 

AAGCTGGCGCTATCTGGGCATCGGGGAGGAAGAAGCCCGTGCCTTTTCCC 

200 

GCGAGGTT GAAGC G G CAT GGAAAGAGTTT GC C GAGGAT GACT GCT GC TGC 

250 

ATT GACGTT GAGCGAAAACGCACGTTTACCATGATGATTCGGGAAGGTGT 

300 

GGCCATGCA.CGCCTTTAACGGTGAACTGTTCGTTCAGGCCACCTGGGATA 

350 

CCAGTTCGTCGCGGCTTTTCCGGACACAGTTCCGGATGGTCAGCCCGAAG 

400 

CGCATCAGCAACCCGAACAATACCGGCGACAGCCGGAACTGCCGTGCCGG 

450 

TGTGCAXSATTAATGACAGCGGTGCGGCGCTGGGATAJTTA.CGTCA.GCGAGG 

500 

ACGGGTATCCTGGCTGGATGCCGCAGAAATGGACATGGATACCCCGTGAG 

550 

TTACCCGGCGGGCGCGCCTCGTTCATTCACGTTTTTGAACCCGTGGAGGA 

600 

CGGGCAGACTCGCGGTGCAAATGTGTTTTACAGCGTGATGGAGCAGATGA 

£50 

AGATGCTCGACACGC TGCAGAACACGCAGCTAGAT XAACCCTAGAAAGAT 



700 

AAT CAT AT T G TGAC G TAG GT TAAAGATAAT CATGCGT AAAAT TGACGCAT 
LEFT TESMHSIAL REPEAT 5 

GGOATCC (SEQ ID NO: 40) 

> 



Ficr. 3Cci) 



pXX-Bac 



Sequence : I - Q 35 5 2 



CTA7L*TTGT^C^GTOVATATTT^^ 

AAATCLAAAAGA* J1XGJUZC GAG.A[TAGGG / rT GAGT GTT GTTC CAGTTTGGAAJCAAGAGTC CACTAS^AAGAACGTGGA 2 °° 



CCGTCTATCAGG<SCC^TGGCCCACTACC^ 



GGGG*TCGAG^TGCCGVEAAAGCACTIJVAATC GSVAC CCTAAAGGGA) 



ao 



GTCACGCTGCGCGTAACCA( 



SOQ 



.CCACACCC<^G<:G«TAATG<^CCGCia^^ 

600 



TA^A^CGACGGCCACTGAGCGCGGCTCGTTTCAr^^ 
CA£3U7GJU\GAXGCTCC^O^ 

LEFT THRMXiCVi R£?EAT ~ > 

>WCS_of_pBSII 



900 



^CG^TG^ATCTGTAArACGACTCACTAO^^ 



AGCCCGGGGGATCCACTAGTTCEXGAfiCGGCa^C^C(XGOTGGAGCTCCAK 



100O 



ATC GGG6AGGAA£XAGCC C GTGC I ITl'TUC C 



I3O0 



L400 



CCG^ACAC^TCCGGATG<3TC^CCGJJ^GCA^ 



GGCGTAATC\T GGTT^TAGXrTGTTTCCTCTGnTGA^^ TGGGGT 



GCCTAATGACTGAGC3\A^^ 



GCGCGGGGA£.\GGCGGTT! 



1900 



130 

(XGTTGCT^GTTTTTXa^IAGSCTCCGCCCCCCTGACG^ 

21 00 
2 ZOO 

GCTTTCTCAJAGCTCAC SCTG1AGGT?JTCT CAGXTCGGTGTAGGT CGTTCGCTCCAAGCT GvGCTGTGTG.C\C SAACCCCCC GTTCAGCCCGACCt^CTSC 

2300 

rCGTCTTTG^JSrCCAAX^CGGTI^A^ 



F/Cr 3 (C&) Coni. 



^AAAAGAGTTGGll^CTCTTGJ^ 

<Col3 l_o r i gi_rs 
I 

290O 

«= _AKPC ELLIN RBSrSTANCE . " 

CCGCCTCCS^CkGrcT>JTT:AJ\3^ 
< AMPCILLZM RESISTANCE 



GGTGrC^GCTCOTCGTTTGGTATGGCCT 
< _ ; AMPCXLLIN K3SXSTANt=S_ 

TCCTTC<^TCCTCCCMCGTTGTCA&;^ 
< AMPCIE-LIN RESISTANCE 

TAAGAT GCTTTTCT GT GACT GGT GAGIACTCAAC CAAGTCATTCT&A£^?\ATAGTG1!AXGC GGC GACCGAGT TGC TCT T GCCC GGCGTCAATAC GGGAIAA. 
< , AMPCIIXXN ElSSrSTAKCE " 

3 40Q 

XACCGCGCC^^O!AGXZAGAACTTTAXAAOTGCTC^ 
< AKPCTTiTtltT RESISTANCE ' 

ATGTAACCCACTCGTGCACCCAACTGA^^ 
< AMPCTT.T.TN- RESISTANCE 

3S00 

GAAIAAGGGC GACACGGAAAT GTTGAAIAC TCAIAC TClTTCCTTTTTCAAIAXTAITGAAGCAXr TAXCAGG<jTTATTGTCTCATG AGC GGA <TW ACAX?-T 
< AMPCIItLIM RESISTANC3_ 

TGAATTt?TA!rTTAG>\AAAATAAA^ ^ SEQ ID NO " 41) 



CO.O0).HinJMI.P S U.S J ll.8 i mHI.(O.02> 



/ £UI 0.34 

I .55 

\ y^WxU linWr ^\ /^\\\ \ .67 
^ / P'goyeac \ Y^*'' ^ *' 



i» 9.3 
Fspl 3 



EcoHl ?..G 1 



p32 Pr ''f>yp70 
v/hK- link-r 10.34 Kb | 




5.35 / 
5.22 



X / j \Sna6t 4.67 



Marl 3.8 r 

\ 

Mcol 4.16 



/ Smal 4.7 
SjtXI 4.79 



5 Jul 

E-»cll 
Wot I 

/2, f 2) 



P| ismid size: I0.54kb 



FIG. 4 



pCa-sper-hs 



-orf 




FIG. -5(A) 

f 



pBSH-hs-orf 

Sequence Range: 1 to 5533 

100 

CTAAATTGTAAGCGTTAATATTTTGTTAAAATTCGCGTTAAATTITTGTTAAATCAGC 
TCATTTTTTAACCAATAGGCCGAAATCGGCAAAATCCCTTAT 



200 

AAATCAAAAGAATAGACCGAGATAGGGTTGAGTGTTGTTCCAGTTTGGAACAAGAG 
TCCACTATTAAAGAACGTGGACTCCAACGTCAAAGGGCGAAAAA 

300 

CCGTCTATCAGGGCGATGGCCCACTACGTGAACCATCACCCTAATCAAGTTTTTTGG 
GGTCGAGGTGCCGTAAAGCACTAAATCGGAACCCTAAAGGGAG 



400 

CCCCCGATTTAGAGCTTGACGGGGAAAGCCGGCGAACGTGGCGAGAAAGGAAGGG 
AAGAAAGCGAAAGGAGCGGGCGCTAGGGCGCTGGCAAGTGTAGCG 

500 

GTCACGCTGCGCGTAACCACCACACCCGCCGCGCTTAATGCGCCGCTACAGGGCGC 
GTCCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGAT 



600 

CGGTGCGGGCCTCTTCGCTATTACGCCAGCTGGCGAAAGGGGGATGTGCTGCAAGG 
CGATTAAGTTGGGTAACGCCAGGGTTTTCCCAGTCACGACGTTG 



700 

TAAAACGACGGCCAGTGAGCGCGCGTAATACGACTCACTATAGGGCGAATTGGGTA 
CCGGGCCCCCCCTCGAGGTCGACGGTATCGATAAGCTATCCAGT 



FIG. 5(B) 



800 



GCAGTAAAAAATAAAAAAAAAATATGTXTTTTTAAATCTACATTCTCCAAAAAAGGG 
TTTTATTAACTTACATACATACTAGAATTGATCCCCGATCCCC 

900 

CTAGAATCCCAAAACAAACTGGTTATTGTGGTAGGTCATTTGTTTGGCAGAAGAAAA 
CTCGAGAAATTTCrCTGGCCGTTATTCGTTATTCTCTCTTTTC 

1000 

TTTTTGGGTCTCCCTCTCrGCACTAATGCTCTCTCACTCTGTCACACAGTAAACGGCA 
TACTGCTCTCGTTGGTTCGAGAGAGCGCGCCTCGAATGTTCG 

1100 

CGAAAAGAGCGCCGGAGTATAAATAGAGCGCTTCGTCTACGGAGCGACAATTCAAT 
TCAAACAAGCAAAGTGAACACGTCGCTAAGCGAAAGCTAAGCAA 

1200 

ATAAACAAGCGCAGCTGAACAAGCTAAACAATCTGCAGTAAAGTGCAAGTTAAAGT 
GAATCAATTAAAAGTAACCAGCAACCAAGTAAATCAACTGCAAC 

<hsp70_promoter 
I 

| 1300 

TACTGAAATCTGCCAAGAAGTAATTATTGAATACAAGAAGAGAACTCTGAATAGGG 
AATTGGGAATTCCTGCAGCCCGGGGGATCCTATATAATAAAATG 

M 1400 

GGTAGTTCTTTAGACGATGAGCATATCCTCTCTGCTCTTCTGCAAAGCGATGACGAG 
CTTGTTGGTGAGGATTCTGACAGTGAAATATCAGATCACGTAA 

1500 



FIG. 5(B) 



GTGAAGATGACGTCCAGAGCGATACAGAAGAAGCGTTTATAGATGAGGTACATGAA 
GTGCAGCCAACGTCAAGCGGTAGTGAAATATTAGACGAACAAAA 

1600 

TGTTATTGAACAACCAGGTTCTTCATTGGCTTCTAACAGAATCTTGACCTTGCCACAG 
AGGACTATTAGAGGTAAGAATAAACATTGTTGGTCAACTTCA 

1700 

AAGTCCACGAGGCGTAGCCGAGTCTCTGCACTGAACATTGTCAGATCTCAAAGAGG 
TCCGACGCGTATGTGCCGC A AT AT AT ATGACCC ACTTTTATGCT 

1800 

TCAAACTA TTTTT 1ACTGATGAGATAATTTCGGAAATTGTAAAATGGACAAATGCTG 
AGATATCATTGAAACGTCGGGAATCTATGACAGGTGCTACATT 

1900 

TCGTGACACGAATGAAGATGAAATCTATGCTTTCTTTGGTATTCTGGTAATGACAGC 
AGTGAGAAAAGATAACCACATGTCCACAGATGACCTCTTTGAT 

2000 

CGATCTTTGTCAATGGTGTACGTCTCTGTAATGAGTCGTGATCGTTTTGATTTTTTGA 
TACGATGTCTTAGAATGGATGACAAAAGTATACGGCCCACAC 

2100 

TTCGAGAAAACGATGTATTTACTCCTGTTAGAAAAATATGGGATCTCTTTATCCATC 
AGTGCATACAAAATTACACTCCAGGGGCTCATTTGACCATAGA 

• " 2200 

TGAACAGTTACTTGGTTTTAGAGGACGGTGTCCGTTTAGGATGTATATCCCAAAGAA 
GCCAAGTAAGTATGGAATAAAAATCCTCATGATGTGTGACAGT 



FIG. 5(B) 



2300 



GGTACGAAGTATATGATAAATGGAATGCCTTATTTGGGAAGAGGAACACAGACCAA 
CGGAGTACCACTCGGTGAATACTACGTGAAGGAGTTATCAAAGC 

2400 

CTGTGCACGGTAGTTGTCGTAATATTACGTGTGACAATTGGTTCACCTCAATCCCTTT 
GGCAAAAAACTTACTACAAGAACCGTATAAGTTAACCATTGT 



2500 

GGGAACCGTGCGATCAAACAAACGCGAGATACCGGAAGTACTGAAAAACAGTCGCT 
CCAGGCCAGTGGGAACATCGATGTT^GTTTTGACGGACCCCTT 

2600 

ACTCTCGTCTCATATAAACCGAAGCCAGCTAAGATGGTATACTTATTATCATCTTGT 
GATGAGGATGCTTCTATCAACGAAAGTACCGGTAAACCGCAAA 

2700 

TGGTTATGTATTATAATCAAACTAAAGGCGGAGTGGACACGCTAGACCAAATGTGTT 
CTGTGATGACCTGCAGTAGGAAGACGAATAGGTGGCCTATGGC 

2800 

ATrATTGTACGGAATGATAAACATTGCCTGCATAAATTCTTTTATTATATACAGCCAT 
AATGTCAGTAGCAAGGGAGAAAAGGTTCAAAGTCGCAAAAAA 

2900 



TTTATGAGAAACCTTTACATGAGCCTGACGTCATCGTTTATGCGTAAGCGTTTAGAA 
GCTCCTACTTTGAAGAGATATTTGCGCGATAATATCTCTAATA 

3000 

TTTTGCCAAATGAAGTGCCTGGTACATCAGATGACAGTACTGAAGAGCCAGTAATGA 



FIG. 5(B) 



AAAAACGTACTTACTGTACTTACTGCCCCtCTAAAATAAGGCG 

3100 

aaaggcaaatgcatcgtgcaaaaaatgcaaaaaagttatttgtcgagagcataata' 
ttgatatgtgccaaagttgtttctgactgactaataagtataat 

3200 

TTGTTTCTATTATGTATAAGTTAAGCTAATTACTTATTTTATAATACAACATGACTGT 
TTTTAAAGTACAAAATAAGTTTATTTTTGTAAAAGAGAG.AAT 

3300 

GTTT A A A A GTTTTGTT A CTTT A G AAG A AATTTTG AGTTTTTG ITTTTT TT 1 ' AAT A AAT A 
AATAAACATAAATAAATTGTTTGTTGAATTTGGATCCACTA 

3400 

GTTCTAGAGCGGCCGCCACCGCGGTGGAGCTCCAGCTTTTGTTCCCTTTAGTGAGGG 
TTAATTGCGCGCTTGGCGTAATCATGGTCATAGCTGTTTCCTG 

3500 

TGTGAAATTGTTATCCGCTCACAATTCCACACAACATACGAGCCGGAAGCATAAAGT 
GTAAAGCCTGGGGTGCCTAATGAGTGAGCTAACTCACATTAAT 

. 3600 

TGCGTTGCGCTCACTGCCCGCTTTCCAGTCGGGAAACCTGTCGTGCCAGCTGCATTA 
ATGAATCGGCCAACGCGCGGGGAGAGGCGGTTTGCGTATTGGG 

, 3700 

CGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAG 
CGGTATCAGCTCACTCAAAGGCGGTAATACGGTTATCCACAG 

3800 



FIG. 5(B) 



AATCAGGGGATAACGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAG 
GAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCG 



3900 

CCCCCCTGACGAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGA 
CAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTC 



4000 

GTGCGCTCTCCTGTTCCGACCCTGCCGCITACCGGATACCTGTCCGCCTTTCTCCCTT 
CGGGAAGCGTGGCGCTTTCTCATAGCTCACGCTGTAGGTATC 

4100 

TCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTC 
AGCCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTC 



4200 

CAACCCGGTAAGACACGACTTATCGCCACTGGCAGCAGCCACTGGTAACAGGATTA 
GCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGG 

4300 

TGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAG 
CCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCA 

4400 

AACAAACCACCGCTGGTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCA 
GAAAAAAAGGATCTCAAGAAGATCCTTTGATCTTTTCTACGGG 



4500 

GTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATC 
AAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGT 



FIG. 5(B) 



<ColEl_origin 

i 

I , 4600 

TTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTA 
ATCAGTGAGGCACCTATCTCAGCGATCTGTCTATTTCGTTCAT 

AMPCILLIN 

. RESISTANCE > 

4700 

CCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCAT 
CTGGCCCCAGTGCTGCAATGATACCGCGAGACCCACGCTCACC 

AMPCILLIN 

RESISTANCE ; ___> 

4800 

GGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTG 
GTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGC 

: AMPCILLIN 

RESISTANCE , > 

4900 

CGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATT 
GCTACAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTT 

- : 'l AMPCILLIN 

RESISTANCE ; > 

5000 

CATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCA 
AAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAG 

, : AMPCILLIN 

RESISTANCE > 

5100 



FIG. 5(B) 



J 



# 



TAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACT 
GTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTAC 

AiVEPCrLLIN 

RESISTANCE. -> 



5200 

TCAAGCAAGTCATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCG 
TCAATACGGGATAATACCGCGCCACATAGCAGAACTTTAAAAG 

: AMPC ILUN 

RESISTANCE -> 



5300 

TGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGT 
TGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATC 

r- . _AMPCIEXJN 

RESISTANCE. „ 



5400 

TTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAA 
TGCCGCAAAAAAGGGAATAAGGGCGACACGGAAATGTTGAATA 

AMPC7T ,T ,TN 

RESISTANCE ^ 



5500 

CTC ATA CTCTTC CTTTTTC AAT ATT ATTG A AGC ATTT ATC AGGGTT ATTGTCTTC 
GCGGATACATATTTGAATGTATTTAGAAAAATAAACAAATAG 



GGGTTCCGCGCACATTTCCCCGAAAAGTGCCAC ' (SEQ IDNNO: 42) 



FIG. 5(B) 



pBS||-IFP2orf 




Sequence Range: 1 to 4971 

100 

CTAAATTGTAAGCGTTAATATTTTGTTAAAATTC 
TCATTTTTTAACCAATAGGCCGAAATCGGCAAAATCCCTTAT 

200 

AAATCAAAAGAATAGACCGAGATAGGGTTGAGTGTTGTTCCAGTTTGGAACAAGAG 
TCCACTATTAAAGAACGTGGACTCCAACGTCAAAGGGCGAAAAA 

300 

CCGTCTATCAGGGCGATGGCCCACTACGTGAACCATCACCCTAATCAAGTTTTTTGG 
GGTCGAGGTGCCGTAAAGCACTAAATCGGAACCCTAAAGGGAG 



400 

CCCCCGATTTAGAGCTTGACGGGGAAAGCCGGCGAACGTGGCGAGAAAGGAAGGG 
AAGAAAGCGAAAGGAGCGGGCGCTAGGGCGCTGGCAAGTGTAGCG 



500 

GTCACGCTGCGCGTAACCACCACACCCGCCGCGC1TAATGCGCCGCTACAGGGCGC 
GTCCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGAT 

600 

CGGTGCGGGCCTCTTCGCTATTACGCCAGCTGGCGAAAGGGGGATGTGCTGCAAGG 
CGATTAAGTTGGGTAACGCCAGGGTTTTCCCAGTCACGACGTTG 



.700 

TAAAACGACGGCCAGTGAGCGCGCGTAATACGACTCACTATAGGGCGAATTGGGTA 
CCGGGCCCCCCCTCGAGGTCGACGGTATCGATAAGCTTGATATC 



FIG. 6(B) 



800 



GAATTCCTGCAGCCCGGGGGATCCTATATAATAAAATGGGTAGTTCTTTAGACGATG 
AGCATATCCTCTCTGCTCTTCTGCAAAGCGATGACGAGCTTGT " 



900 



TGGTGAGGATTCTGACAGTGAAATATCAGATCACGTAAGTGAAGATGACGTCCAGA 
GCGATACAGAAGAAGCGTTTATAGATGAGGTACATGAAGTGCAG 



1000 



CCAACGTCAAGCGGTAGTGAAATATTAGACGAACAAAATGTTATTGAACAACCAGG 
TTCm-CATTGGCTTCTAACAGAATCTTGACCTTGCCACAGAGGA 



1100 



CTATTAGAGGTAAGAATAAACATTGTTGGTCAACTTCAAAGTCCACGAGGCGTAGCC 
GAGTCTCTGCACTGAACATTGTCAGATCTCAAAGAGGTCCGAC 



1200 



GCGTATGTGCCGCAATATATATGACCCACITTTATGCTTCAAACTATTTm 
GAGATAATTTCGGAAATTGTAAAATGGACAAATGCTGAGATA 



1300 



TCATTGAAACGTCGGGAATCTATGACAGGTGCTACATTTCGTGACACGAATGAAGAT 
GAAATCTATGCTTTCTTTGGTATTCTGGTAATGACAGCAGTGA 

1400 

GAAAAGATAACCACATGTCCACAGATGACCTCTTTGATCGATCTrrGTCAATGGTGT 
ACGTCTCTGTAATGAGTCGTGATCGTTTTGATTTTTTGATACG ~' 

1500 

ATGTCTTAGAATGGATGACAAAAGTATACGGCCCACACTTCGAGAAAACGATGTATT 

FIG. 6(B) 



TACTCCTGTTAGAAAAATATGGGATCTCTTTATCCATCAGTGC 

, 1600 

ATACAAAATTACACTCCAGGGGCTCATTTGACCATAGATGAACAGTTACTTGGTTIT 
AGAGGACGGTGTGCGTTTAGGATGTATATCCCAAACAAGCCAA 

1700 

GTAAGTATGGAATAAAAATCCTCATGATGTGTGACAGTGGTACGAAGTATATGATA 
AATGGAATGCCTTATTTGGGAAGAGGAACACAGACCAACGGAGT 



1800 

ACCACTCGGTGAATACTACGTGAAGGAGTTATCAAAGCCTGTGCACGGTAGTTGTCG 
TAATATTACGTGTGACAATTGGTTCACCTCAATCCCTTTGGCA 



1900 

AAAAACTTACTACAAGAACCGTATAAGTTAACCATTGTGGGAACCGTGCGATCAAA 
CAAACGCGAGATACCGGAAGTACTGAAAAACAGTCGCTCCAGGC 



2000 

CAGTGGGAACATCGATGTTTTGTTTTGACGGACCCCTTACTCTCGTCTCATATAAACC 
GAAGCCAGCTAAGATGGTATACTTATTATCATCTTGTGATGA 



2100 

GGATGCTTCTATCAACGAAAGTACCGGTAAACCGCAAATGGTTATGTATTATAATCA 
AACTAAAGGCGGAGTGGACACGCTAGACCAAATGTGTTCTGTG 



, 2200 

ATGACCTGCAGTAGGAAGACGAATAGGTGGCCTATGGCATTATTGTACGGAATGAT 
AAACATTGCCTGCATAAATTCTTTTATTATATACAGCCATAATG 



2300 



FIG. 6(B) 



TCAGTAGCAAGGGAGAAAAGGTTCAAAGTCGCAAAAAATTTATGAGAAACCTTTAC 
ATGAGCCTGACGTCATCGTTTATGCGTAAGCGTTTAGAAGCTCC 

2400 

TACTTTGAAGAGATATTTGCGCGATAATATCTCTAATATTTTGCCAAATGAAGTGCCT 
GGTACATCAGATGACAGTACTGAAGAGCCAGTAATGAAAAAA 



2500 

CGTACITACrGTACrTACTGCCCCTCTAAAATAAGGCGAAAGGCAAATGCATCGTGC 
AAAAAATGCAAA^AGTTATTTGTCGAGAGCATAATATTGATA 



2600 

TGTGCCAAAGTTGTTTCTGACrGACTAATAAGTATAATTTGTTTCrATTATGTATAAG 
TTAAGCTAATTACTTATTTTATAATACAACATGACTGTTTTT 

2700 

AAAGTACAAAATAAGTTTATTTTTGTAAAAGAGAGAATGTTTAAAAGTTTTGTTACT 
TT AG A A G A AATTTTG AGTTTTTG ITTTTTTT 1' AAT AAAT AA AT 



2800 

AAACATAAATAAATTGTTTGTTGAATTTGGATCCACTAGTTCTAGAGCGGCCGCCAC 
CGCGGTGGAGCTCCAGCITTTGTTCCCrrTTAGTGAGGGTTAAT 

2900 

TGCGCGCITGGCGTAATCATGGTCATAGCTGTTTCCTGTGTGAAATTGTTATCCGCTC 
ACAATTCCACACAACATACGAGCCGGAAGCATAAAGTGTAAA 

• - 3000 

GCCTGGGGTGCCTAATGAGTGAGCTAACTCACATTAATTGCGTTGCGCTCACTGCCC 
GCTTTCCAGTCGGGAAACCTGTCGTGCCAGCTGCATTAATGAA 



FIG. 6(B) 



>ColEl_origin 



3100 



TCGGCCAACGCGCGGGGAGAGGCGGTTTGCGTATTGGGCGCTCTTCCGCTTCCTCGC 
TCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGT 



3200 



ATCAGCTCACTCAAAGGCGGTAATACGGTTATCCACAGAATCAGGGGATAACGCAG 
GAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGT 



3300 



AAAAAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGACGAGCATCAC 
AAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGAC 



3400 



TATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGA 
CCCTGCCGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCGGG 



3500 



AAGCGTGGCGCTTTCTCATAGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGT 
TCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCC 



3600 



GACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGAC 
TTATCGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAG 



3700 



CGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTACA 
CTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGT 



3800 



FIG. 6(B) 



TACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTA 
GCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAA 



3900 



AAAGGATCTCAAGAAGATCCITrGATCTTTTCTACGGGGTCTGACGCTCAGTGGAAC 
GAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAA 



4000 



GGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTA 
TATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAG 



4100 



TGAGGCACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCCCC 
GTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGC 
AMPCILLIN 



RESISTANCE 



4200 



CCCAGTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCA 
ATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTG 
: _AMPCEXIN 



RESISTANCE 



4300 

CAACTTTATCCGCCTCCATCCAGTCrATTAATTGTTGCCGGGAAGCTAGAGTAAGTA 
GTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTAC 

'. AMPCILLIN 

RESISTANCE - > 

4400 



FIG. 6(B) 



AGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAA 
CGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAA 
__AMPCILLIN 



RESISTANCE 



4500 

GCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTA 
TCACTCATGGTTATGGC AGC ACTGCATAATTCTCTTACTGTCA ' " ' 

: AMPCILLIN 

RESISTANCE 



4600 

TGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAG 
AATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAAT 

'. AMPCILLIN 

RESISTANCE . 



4700 

ACGGGATAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAAC 
GTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGA 

: AMPCILLIISr 

RESISTANCE 



4800 

TCCAGTTCGATGTAACCCACTCGTGCACCCAACTGAT(m'CAGCATCTTTTACTTTCA 
CCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCG 

. AMPCILLIN 

RESISTANCE ^ 



4900 



CAAAAAAGGGAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTTC 
AATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGG ' 

FIG. 6(B) 



.AMPCILLIN RESISTANCE 



ATACATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCC 
CCGAAAAGTGCCAC (SEQ IDSNO: 43) ' 



FIG. 6(B) 



pEPSII-hs-orf 




PSEI 

cj EcoRl 
.BamHl 



^ FIG. 7<A) 



pBSII-os-orf 



FlQr. ICQ) 



Sequence Range: 1 to 5533 

AAATCXAAAGAAIAGAC C GAGJlHIAGGG-TT GAGTGT TGTTC CAGTTTGGAACAA/SA.GTC CACtCATT AAAGAAC GTGGACT CCAACGTCAAAGGGCGXA^V^ 
CCGTCTATCAGGGCGAXGGCCCACrACGTGAACCA^ 

GTCACGCTC^GCGrAACCACCA^ 
CGGTGCGC^CrCTTC^TTACGCC^ 



CTAGAATCCCAAAACAAACT GGTIATTGTGGIAf^TTIAlTT< 



tttttgg<;tctccctctctgc^^ 



90O 

CTTATTCTCTCTTTTC 
LOOO 

"GCTLIUt'I It^GTTCGAGJV^AGCGX^GCCTCGAATGTTCG 

110a 

rCGCTViGC GAAAGCTAAGCAA 



<hap70_proiBoTiar 
I 

rACTCAAATCTKOAI^rAAT^VXTGAAia 

*»«K=P«»araBWCCBIia«a^^ 

TCGTGACACG^JVIGAAGATGiW»iATCX^GCTTTCTT^^ 

TTCGAG^^^O^TTA^CCT 

TGAACAGTTACTtG^TTTTAGAGGACGGXGtCCGTTl^^ 
GGT^GAAGTATATGATXAATGGAATC^CTTATTTGGGA^ 



2 400;_ 
G J T^TA^.GTT?J^C\TTGT 



FlCr 7( B) Conh 



GGGAACCGTGCGAXCAAACAAACTCGAG^ 
ACTCTCG^CrC^ZAl^AAAC^ 

T GGTTAX G11AJTTATAATCAAAC1!AAAG<SC G GAGT GGACAC GCTAGACCAAATGTG TTC7GT GATGAC CT 5<IAGT AGGAAGAC GAAX? GGT GGO C t^TGG c 

ArCA^GTACt^ATGATAAACAX^ 

TTTATGAGJW,CCTTTACArGAGCCTGACGTCATC 

T TTTGC CAAAT'SAAiOT GCCTGG2EACATCAGATG 

AA&GGCAAATGCATC G^t^CAAA?LAATGCAAAAAAGTTAXT T G T C G A GAGCATflAXATTGAX^ZX^SCCAAA G^ E 1 GTJ J ICTGACTGACTAATAAGTATAA^ 

TTGTTTCTA!nL^GT7V3^ 

GTTIAAAAGTTTTGTTACTTTAG^ 

TGTGAAATTGTTATCCGCTCAXAATTCCA^ 

^^C^TT GCGCTCAC TGCCCGCTTTCCAGTC GGGAAACCTGTCGTGCXIAGCTGCATTAATGAATCGGCCAACGCGC GGG>GAGAG<^GGTTrTGCGTAxrG^G 
C^TCTTCCGCTTCCTCGCTCI^TG^ 

AATCAGGGGATAACGCLAGGAAAGAACATTGT CGCGTTGCTl 3 ° 



CCCCCCTGACGAG 



AIAfiGCTCCG 
3900 



400O 

GTGCGCTCTCCTGKITCCC^CCTG^^ 

4100 

TCA GTTCGGTGTAGGTCGTTCGCTCCA?J^TG<^ 

42QO 

CAACCCGGTAAGACACG^XITTATC 

43 OO 

TGGCCTAACTACGGCTAC^TA^ 

_ 4,400 
AACAAACO^CGCTG^TAG<:G^TG^TTrXrrTGTT7G<lAAG 

4500 

GTCTGACG^rCAGTGGAACGAAAACTCACGTTAAGGGA3OT^ 

I 

I 4600 
T TT AAAT CAAT CTAAAG T AT AT AT GAGT AAACTT GGT C T GACAGT TAG CAATGCT 1AAT CAGTGAGGCAC CTAX CTCAGCGATC T GT CTAT T*I C GTTCAT 

AWPCII^CrN RKSIST!AMCS > 

4700 

c GATAGTTGCC TGrilTCCC C GTCGT GXAGATAAC TACGAT ACGGGAGGGC TTACCATCTGGCCCCAGTGCTGCAAT GATAC CGC GAGACCCAC GCTCACC 
AMPC ILXtTN RESISTANCE 

-- . ' :_ - 43 oo 

GG^T CC AGA??TATCJU=CAA£AAAC CAGCCAGCCGGAAGGGCCGAGC GCAGAAGTGGTCC IGCAACTTTA'TCCGC CTC-TATCO jGTCT^ JTT_~_- JTGTTGC 
AK5C:iLKI RESISTANCE > 



FlO. 7CB) Conf. 

— AMPCILLIN RESISTANCE ■ ^^it&rTT^IATGGCtr 

■ _ > 

AMFCILLXN KZSISTANC5 ^^nwicA^g 



TGCTCATCATTGGAAAACGTTCTTCGGGGCGA^^ 530 Q 

. 1 ___AMPcn^LtN RELS ISHANC2 



> 



, ■ > 



'AAACAAAXAG 



GGGTTCCGCGCAOVTTTCCCCGAAAAGTGCCAC ( SEQ ID NO I 42 ) 



pBSIMEl -orf 



3 0r 



Kpnl 
Apal 
'Xhol 
Sail 



Xbaly 
Speli 
BamHI 



■ PsW 



FIG- 8C 



r Pstl 
BamHI 



BgMI 



Sequence Range: 1 to 5523 



100 



CTAAATTGTAAGCGTTAATATTTTGTTAAAATTCGCGTTAAATTTTTGTTAAATCAGC 
TCATTTTTTAACCAATAGGCCGAAATCGGCAAAATCCCTTAT 



200 



AAATCAAAAGAATAGACCGAGATAGGGTTGAGTGTTGTTCCAGTTTGGAACAAGAG 
TCCACTATTAAAGAACGTGGACTCCAACGTCAAAGGGCGAAAAA 



300 



GCGTCTATCAGGGCGATGGCCCACTACGTGAACCATCACCCTAATCAAGTTTTTTGG 
GGTCGAGGTGCCGTAAAGCACTAAATCGGAACCCTAAAGGGAG ' " 



400 



CCCCCGATTTAGAGCTTGACGGGGAAAGCCGGCGAACGTGGCGAGAAAGGAAGGG 
AAGAAAGCGAAAGGAGCGGGCGCTAGGGCGCTGGCAAGTGTAGCG 



500 



GTCACGCTGCGCGTAACCACCACACCCGCCGCGCTTAATGCGCCGCTACAGGGCGC 
GTCCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGAT 



600 



CGGTGCGGGCCTCTTCGCTATTACGCCAGCTGGCGAAAGGGGGATGTGCTGCAAGG 
CGATTAAGTTGGGTAACGCCAGGGTTTTCCCAGTCACGACGTTG 



700 



TAAAACGACGGCCAGTGAGCGCGCGTAATACGACTCACTATAGGGCGAATTGGGTA 
CCGGGCCCCCCCTCGAGGTCGACGGTATCGATAAGCTTCGATGT 



FIG. 8(B) 



800 



CTTTGTGATGCGCCGACATTTTTGTAGGTJATTGATAAAATGAACGGATACAGTTGC 
CCGACATTATCATTAAATCCTTGGCGTAGAATTTGTCGGGTCC 



900 



ATTGTCCGTGTGCGCTAGCATGCCCGCTAACGGACCTCGTACTTTTGGCTTCAAAGG 
TTTTGCGCACAGACAAAATGTGCCACACTTGCAGCTCTGCATG 



1000 



TGTGCGCGTTACCACAAATCCCAACGGCGCAGTGTACTTGTTGTATGCAAATAAATC 
TCGATAAAGGCGCGGCGCGCGAATGCAGCTGATCACGTACGCT 



1100 



CCTCGTGTTCCGTTCAAGGACGGTGTTATCGACCTCAGATTAATGTTTATCGGCCGA 
CTGTTTTCGTATCCGCTCACCAAACGCGTTTTTGCATTAACAT 



1200 



TGTATGTCGGCGGATGTTCTATATCTAATTTGAATAAATAAACGATAACCGCGTTGG 
TTTTAGAGGGCATAATAAAAGAAATATTGTTATCGTGTTCGCG 



1300 



attagggcagtataaattgacgttcatgttggatattgtttcagttgcaagtGaatt 
cctgcagcccgggggatcctatataataaaatgggtagttctt 

1400 

TAGACGATGAGCATATCCTCTCTGCTCTTCTGCAAAGCGATGACGAGCTTGTTGGTG 
AGGATTCTGACAGTGAAATATCAGATCAGGTAAGTGAAGATGA 

1500 

CGTCCAGAGCGATACAGAAGAAGCGTTTATAGATGAGGTACATGAAGTGCAGCCAA 

FIG. 8(B) 



) 



CGTCAAGCGGTAGTGAAATATTAGACGAACAAAATGTTATTGAA 



1600 



CAACCAGGTTCTTCATTGGCTTCTAACAGAATCTTGACCTTGCCACAGAGGACTATT 
AGAGGTAAGAATAAACATTGTTGGTCAACTTCAAAGTCCACGA ^ U ^ UALTATT 



1700 

GG ^ AG ^ 

TGTGCCGCAATATATATGACCCACTTTTATGCTTCAAACTATT 

1800 

TTTTACTGATGAGATAATTTCGGAAATTGTAAAATGGACAAATGCTGAGATATCATT ■ 
GAAACGTCGGGAA^CXA^GA-CA^SQIGGTA^A^H^GG^FGA-GAG G 



1900 



i A I GA i GATGAAATCTATGCmCmGGTA ^ CT GGTAATGACAGCAGTGAGAAAA 
GATAACCACATGTCCACAGATGACCTCTTTGATCGATCTTTGT 



2000 



CAATGGTGTACGTCTCTGTAATGAGTCGTGATCGTTTTGATTrnTGATACGATGTCT 
TAGAATGGATGACAAAAGTATACGGCCCACACTTCGAGAAAA 



2100 



CGATGTATTTAGTCCTGTTAGAAAAATATGGGATCTCTTTATCCATCAGTGCATACA 
AAATTACACTCCAGGGGCTCATTTGACCATAGATGAACAGTTA 



2200 



TATGGAATAAAAATCCTCATGATGTGTGACAGTGGTACGAAGT 



2300 



FIG. 8(B) 



) 



ATATGATAAATGGAATGGCTTATTTGGGAAGAGGAACACAGACCAACGGAGTACCA 
CTCGGTGAATACTACGTGAAGGAGTTATCAAAGCGTGTGCACGG 



2400 

TAGTTGTCGTAATATTACGTGTGACAATTGGTTCACCTCAATCCCTTTGGCAAAAAA 
CTTACTACAAGAACCGTATAAGTTAACCATTGTGGGAACCGTG 



2500 

CGATCAAACAAACGCGAGATACCGGAAGTACTGAAAAACAGTCGCTCCAGGCCAGT 
GGGAACATCGATGTTTTGTTTTGACGGACCCCTTACTCTCGTCT 



2600 

CATATAAACCGAAGCCAGCTAAGATGGTATACTTATTATCATCTTGTGATGAGGATG 
CTTCTATCAACGAAAGTACCGGTAAACCGCAAATGGTTATGTA 



2700 

TTATAATCAAACTAAAGGCGGAGTGGACACGCTAGACCAAATGTGTTCTGTGATGAC 
CTGCAGTAGGAAGACGAATAGGTGGCCTATGGCATTATTGTAC 

2800 

GGAATGATAAACATTGCCTGCATAAATTCTTTTATTATATACAGCCATAATGTCAGT 
AGCAAGGGAGAAAAGGTTCAAAGTCGCAAAAAATTTATGAGAA 



2900 

ACCTTTACATGAGCCTGACGTCATCGTTTATGCGTAAGCGTTTAG.AAGCrCCTACTTT 
GAAGAGATATTTGCGCGATAATATCTCTAATATTTTGCCAAA 

3000 

TGAAGTGCCTGGTACATCAGATGACAGTACTGAAGAGCCAGTAATGA.AAAAACGTA 
CTTACTGTACTTACTGCCCCTCTAAAATAAGGCGAAAGGCAAAT 



FIG. 8(B) 



3100 



GCATCGTGCAAAAAATGCAAAAAAGTTATTTGTCGAGAGCATAATATTGATATGTGC 
CAAAGTTGTTTCTGACTGACTAATAAGTATAATTTGTTTCTAT 

3200 

TATGTATAAGTTAAGCTAATTACTTATTTTATAATACAACATGACTGTTTTTAAAGTA 
CAAAATAAGTTTATTTTTGTAAAAGAGAGAATGTTTAAAAGT 



3300 



TTTGTTACTTTAGAAGAAATTTTGAGTTTTTGTT^ 
AAATAAATTGTTTGTTGAATTTGGATCCACTAGTTCTAGAGC 



3400 



GGCCGCCACCGCGGTGGAGCTCCAGCTTTTGTTCCCTTTAGTGAGGGTTAATTGCGC 
GCTTGGCGTAATCATGGTCATAGCTGTTTCCTGTGTGAAATTG 



3500 



TTATCCGCTCACAATTCCACACAACATACGAGCCGGAAGCATAAAGTGTAAAGCCT 
GGGGTGCCTAATGAGTGAGCTAACTCACATTAATTGCGTTGCGC 



>CoLEl_origin 
I 

3600 

TCACTGCCCGCTTTCCAGTCGGGAAACCTGTCGTGCCAGCTGCATTAATGAATCGGC 
CAACGCGCGGGGAGAGGCGGTTTGCGTATTGGGCGCTCTTCCG 

' 3700 

CTTCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAG 
CTCACTCAAAGGCGGTAATACGGTTATCCACAGAATCAGGGGA 

3800 



FIG. 8(B) 



TAACGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAA 
AAGGCCGCGTTGCTGGCGTTTTTCCATAGGCTCCGCCCCCCTGAC 

3900 

GAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATA 
AAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTC 



4000 

CTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGt 
GGCGCTTTCTCATAGCTCACGCTGTAGGTATCTCAGTTCGGT 



4100 

GTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGGACGAACCeCCCGTTCAGCCCGACCG 
CTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTA 



4200 

AGACACGACTTATCGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAG 
GTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACT 

4300 

ACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCT 
TCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCAC 



4400 

CGCTGGTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGG 
ATCTCAAGAAGATCCTTTGATCTTTTCTACGGGGTCTGACGCT 



4500 

CAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGAT 
CTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAA 



FIG. 8(B) 



4600 

TCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAA.TCAGTGAGG 
CACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGC 

: AMPCILLIN 

RESISTANCE . > 

4700 

CTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAG 
TGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGAT 

: AMPC TTTTN 

RESISTANCE > 

4800 

TTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAAC 
TTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTA 

: AMPCII T TNT 

RESISTANCE ' > 

4900 

GAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAArGTTGTTGCCATTGCT AC AGGC A 
TCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTC 
___AMPCILLIN 



RESISTANCE 



5000 

CGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGT 
TAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCC 

AMPCILLIN 

RESISTANCE - ^ 



5100 

GCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCAT 



FIG. 8(B) 



CCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGT 

AMPCILLIN 

RESISTANCE ■ 



5200 



CATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAATACGGG 
AT A ATACC GC GC C A C AT AGC AG A A CTTT A A AA GTGCTC ATC AT ' ~ ~ 

AMTTTTTTM 



RESISTANCE 



5300 

TGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAG 
TTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCT 

— : AMPCILIJN 

RESISTANCE ' . 



5400 

TTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAA 
AAAGGGAATAAGGGCGACACGGAAATGTTGAATACTCATACTCT 

— '■ . AMPCILXJN 

RESISTANCE 



5500 



TCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACAT 

atttgaatgtatttagaaaaataaacaaataggggttccgcg 
cacatttccccgaaaagtgccac (seq id no: 44) 



FIG. 8(B) 



p3xf?3-DsRed-orf 

CMV- 




1 ^ 
1 

CD 



°0, 



009G 00££ 




Pstl 



FIG. 9( A ^ 



P«P3-»«B«l-orf 9 (B) Sequence Ran**: ! to S5a< 



LOO 

T AGTTATTAATAGTAAT CAA TT ACG GGGTCAT TAGT TCAIAGCCCAT ATAT GGAG TT CC GC GT TACATAAC T TACGGTAAATGGCC C GC CTGGG TGAC C G 
_ L QSV PFtnMOTKH ^^^^ ~ " > 

CCCAACG^CCCC^CCATTGT^^ 

CMV BR0M0T2R ^ > 

AAACC GCC CAC ITGGCAGT ACATCAAGI GTAT CAT AXGC CAAGTACGC CC CC TATTGACGTCAATGACGGTAAATGGCCCGCC T GGGAT TATTGC C CAG^A 
QfV PROMOTSR ' > 

C ATGAC CI TAT GGGAC TTT CCTACTTGGCAGTACATGTACGT^ 

, . CMV PROROTS R > . 

TAGCGGTTTXiSwCTCACC^SGGATTTC 

CMV PROMCT5R > 

60 0 

ACAACTCCGCC CCATTGACGCSxAAXGGC^^ 

; CKV PROMOTER > 

: PIGGYBAC ORF ■ ^ 

300 

TCTGACAGTGAAAIAICAJSAT<3VCGT^UU^ 

PIGGYBAC ORF ' > 

900 

GCGGTAGTG7wAATATTAGACGAACA2^AJGTX?Jrr« 

. PISGY3AC ORF . ____ > 

1000 

PIGGYBAC ORF __ > 

11O0 

CGC ^ 3 M^^^ GAGAT!ATCATTGAP_A.C 
PIGGYBAC ORF '_ ' > 

1200 

CJTC*5GGAAXCTWX3\CAGGTGC^^ 

PIGGYBAC ORF > 

L3O0 

CCUVC\XGTCCACAGATGACCTCTTTGATCr^^ 

P IGGYBAC ORF . > 

1400 

ATGGAT GACAAAAGXATACGGCCCACACTTC GAGAAAAC^^ CTCTTTATCCAXCAGTGC^ACiAAAAXT 
'_ PIGGYBAC OR? - > 

ISOO 

ACACTCCAGGG^TC^^TGACCLATAGATGAACAG 

PIGGYBAC ORF > > 

AATAAAAATCdTCATGATGTGTGACAGTGGTA^ 

, _ PIGGYBAC ORF __ > 

1700 

GAAIAC TAG GT QAAGGAGTT ATCAAAGC C T GTGCAC GGIAGTTGT CGTA\TAX TAC GT GT GACAATTGGTTCAC CT CAA.TC C CTTETT GGCAAAAAACT TAC 
PIGGYSAC ORF > 

1800 

T ACAAGAAC C GTATAAG TT A^CCAX TGI GGGAAC C GT GC GAT CA\ACAAAC GC GAGATAC C GGAAGTAC 7 GAAAAACAG TC GC T C CAGGCCAGT GGGAA C 



) 



P70. 9CB) Conf- 



ATCGAaGrTTTGTTTTGACGGACCCCTTAC^^ 

PIGGYBAC ORF 

A rCAAC GAAAG T ACC GGTAAAXI CGCAA\r GGTTVT GTAXTAIAAT CAAAC TAAAGGCGGAGTGGA CAC GC T!AGACCAAAT G TGTT C TGT GA 1 ^ ACC TGCA 
PXGGYHAC ORF 



GTAGGAAGACGAA3MGTGGCC1M GTAGCAA. 
: : PIGGYBAC ORF "• 2 

G GGAGAAAAGGTTCAAAGTC GCLAAAAAATT TATGAGAAACCTTTACAXGAGCCT GAC GT C AT C GTT TAX GCGTAAGC GT T TAGAAGOTC CTACT ITGAAG- 
; PXGGYHAC ORF ^ 



2300 

AGATATXTGCGCGAXAA3ATCTCtrAA3!ATTTTG^ 

_ PIGGYBAC ORF ; ~~ ' ^ 

2400 

GTAXTTACTG^CCCTTCTAAAATJUVGC^ 

PIGGYHAC ORF > 

TTGTTTCTGACTGACTAAIAAGTATAA^^ 



'TACAAA 

PIGGYBAC ORP > 



ATAAGTTTAXTTTTGTAAAAGAGAGAATGT^^ 2 6 ° ° 



PIGGYBAC ORF 



AAATT G TTT GT T GAATTTGGJVT CTC GAGGT TCCCA.CAAT GGTTAATTOGAGCT C GCC CGGGGATCXAATTCAAT TAGAGAC TAATTCAATTAGAGCTAAT 

- ^__3XP3 PROMOTER. - 

PIGGYHAC OR? > - — _ . 

2800 

T^ZAATTAGGAXX:CAAG<rrtuX^ATTT^ 

' 3XP3 PROMPTE R > 

_ 2900 
GTGAACACGrCGCTAAGCGAAAGCTAAGOAATA^ 

. : 3XP3 PROMOTER > 

3000 



DSRED GENE 



3100 

GAfiTTCATGCGCTTCAAGGTGCGCATGGAGGtK^C^^ 

D SRED GENS > 

3200 

CC&TGAAGCTGAAGGrGACCAAGGGCGGCCCCCTGCCCTTCGCC^ 

_ D SRED GENE > 

3300 

CGCCGACATCC CCG^TACAAGAAGCTGTCC^ 

' . 03 RED GENE _ > 

3400 

GACTCCT C C CT GXIAGGACGGCTGCTTtZATC TAC AAGGTGAAGTTCATCGGG GT GAACTTC CCCTCCGACGGCCCCGTAATGCAGAAGAAGACCATGGGCT 
, DSRED GESE > 

3500 

GGGAGGCCTCCACCGAGCGCCT GTACCCCCGC GAC GGC GT GCTGAAGGGC GAGATCCACAAGGCC C TG AAGCTGAAGGACGGCGGGCAJZTACCTGGTGG A 
DSRED GENE > 

350 0 

GTTCAAGTCCATCTACATGGCCAA^ 

. *"' DSRED GE2JE "~ ~ ...... > 



F/G-. SCB) C^rf. 



TACTTGCTTTAAAAAAOT^ 

>£ l_s ingl e- r end_DNA_or i gin 
I 
I 

AA-TGTAAGv-GTTAA^TTTTGTTAAAATTCGCOTILV^ 
TCLAAAAGAATAGACCQAGAXAGGGTTG^^ 



c CGAmAGAGCTTGACGGGGAAAGCCGG^ 

ACGCTGCGCGTAACCACCACMCCGC 

>Haccr i ai. _j r amor, e r_fo r_ t x?r&5 5ioiB_ojf_Kan_r eaistanca g ene 

I 
I 



TATTj 



TTCTAAATACATTCAAATXTGTATCCGCTCAr^ 



>SV40_earlyj)romot«sr_and_oriyin w of_replicition 
AACCAGCTGTG^ATGTGTGTC^TTAG<OT 
<*^TGG*AAGTCCCCAGOT 
GCCCCTAACrCCGCCCA^^ 
CAGAAGTAGTOAGGAGGCTT^ 



CAC G<ZAGGTTC TCCGGC C GCTTGGG-TGGAGAGGCTAT TC GGCTAT GACT GGGCAC AACAGACAAT C TGCTCTGA5TGCC GCC GTGTT CCGGCTXGTCAG 
; — : KANftMXCXN RESISTANCE GENE > 

5100 

ITTTTGTCAAGACt^ACCTGl^CGG^GCCCTGAATGAACTGCAAfiAC GAGGCAGXIGCGGCTAICGTGGCXGGCC5\CGACGGG 
RHSISrANCH GENE 

4 ,- ■ . •> 



CGTTCCtCTGCOCAGOTGTGCTCGACGTTGTCACTGJ^ 

_ ! KANAMXCXN RESISTANCE G2NB > 

GCTCCTGCCGAGAAAGT>^CC?LTCATCSGCrTGAXGCA 

KANMfifCIM RES Z5TANCS GENE ; > 



BGXNAMTCrN RESISTANCE GENE . 

C^AGGCGAG^TGCCCGiUrGGCGAGCATCTCGTC 



_3ANAH*CIN RSSISTANCS GEN2 



5600 

GACTGTGGOCGGCTGGGTGrGGC GG AC CG<! TAT CAG^ACATAGCGTTGGC TACOC GTG?JIIATTGCTG?J^GAGCTTG<^GGCGAAXGGGC TGAC CGCTTC C 
1 = KAN?.KYCIN RESISTANCE GENS " > 



Flo. 9C8) cml 



TCGTGCTTT^GGT^CGCCGCTCCCGATTCG^ 

RESISTANCS GZNB ' > 

<^CAAGCGACGCCCAACCTGCC.VTC^ 

5900 

TGATCCTCC^GCOTGGATCTCMGCT©^^ 

>He rpc3_ 3 iiqplex_viru3_ (HSV) _rhyraidine_klnaae_ (TK) _polyA_3igxwXi 

I 

' 5000 
TATGAC GCCAATAAAAAllACAGAAIILAAAAC GCACGGTG x T GGGTC GTTTGTrTCATAAAC GC GGGG TT OSG TCXI CAJSGi^TGCC-^-CTCTOTC GAT AC CCCA 

6T.0 0 

CCGAGACCCCATTG<3G<3CCAAIAC^^ 

' 6200 

CGGCAGGCCCTGCCATJUSCXTQ^GCrTAC^ 

>pUC_j>l aamid_r epl ica ti ca^or igin 
[ 

I 6300 
TAATCT CAT GAC CAAAAT CC CTTAAGG^TGAGTCTTT C GT "PC CACTGAGC GT CAGAC CCCGT AGAAAAGATCAAAC^GJVT CTTCTT GAGATCC TTTT TTT C T G- 

6400 

CGCGTAATCTGCTGCTTGCJ\AACAAAAAAACC^^ TACCAACTCTTTTTCC GAAGGTAAGTGGCT 

65 0 0 

T CAG^_AC^GGGCA.GATACCAAAXAC TGTCCTTCTS^TG'rAGCCGTAGTTAGGCCA.CCAC TTtlXAGAACTC TGTAG^-ACCGCCTACATAC CTCGC TCT GCT 

6600 

AATCCTGrTACCAGTCGXTTGCTCCCACi^^ 

6700 

AO=GGGWTTCQTGOVCACAGCCCAGC7irr^^ 

ea 00 

G^M^AAG<^GGACAG«lMCCGGTAAGCa5G<^^ 

S900 

CGGOTTTKZGCCACCTCTGACTTC^V^ 

CTGGCCTCTT^TIX^CTTT^^ ( SEQ ID NO I 45) 



pCRII-iTR 



20 o 






n resistance 1 



Pstl 



Bgfll 



FIG. 10 OO. 



Sequence Range: 1 to 4613 



100 

agcgcccaatacgcaaaccgcctctccccgcgcgttggccgattcattaatgcagct" 
ggcacgacaggtttcccgactggaaagcgggcagtgagcgcaa 



200 

CGCAATTAATGTGAGTTAGCTCACTCATTAGGCACCCCAGGCTTTACACTTTATGCTT 
CCGGCTCGTATGTTGTGTGGAATTGTGAGCGGATAACAATTT 

300 

CACACAGGAAACAGCTATGACCATGATTACGCCAAGCTTGGTACCGAGCTCGGATC 
CACTAGTAACGGCCGCCAGTGTGCTGGAATTCGGCTTGGATCCC 



400 

ATGCGTCAATTTTACGCAGACTATCTTTCTAGGGTTAATCTAGCTGCATCAGGATCAT 
ATCGTCGGGTCTTTTTTCCGGCTCAGTCATCGCCCAAGCTGG 

LHQDHIVGSFFRLSHRPS W> 

B (CAPSID COMPONENT;533); CODON_START=L 

DB_XREF=P > - . ■' 

LHQDHIVGSFFRLSHRPS W> 
PROCESSED B; CODON_START=l 

[SPLIT] > 

RIGHT TERMINAL REPEAT > 



500 

CGCTATCTGGGCATCGGGGAGGAAGAAGCCCGTGCCTFTTCCCGCGAGGTTGAAGC 
GGCATGGAAAGAGTTTGCCGAGGATGACTGCTGCTGCATTGACG 

RYLGIGEEEARAFSREVEAAWKEFAEDDCCC 

I D> 

B (CAPSID COiVIPONENT;533); CODON_START=l; 

DB_XREF=PID:G215108; TRA [SPLIT] > 



FIG. 10(B) 



) 



RYLGIGEEEAR AFSR'EVEAA WKEFAEDDCCC 

I D> 

PROCESSED B; CODON_START=l 

rSPLITI ■ _> 



600 

TTGAGCGAAAACGCACGTTTACCATGATGATTCGGGAAGGTGTGGCCATGCACGCCT 
TTAACGGTGAACTGTTCGTTCAGGCCACCTGGGATACCAGTTC 

VERKRTFTMMIREGVAMHAFNGELFVQATW 
D T S S> " ■ 

B (CAFSID COMPONENT;533); CODON_START=l; 

DB_XREF=PID:G215108; TRA [SPLIT] > 

VERKRTFTMMIREGVAMHAFNGELFVQATW 
D T S S> 

. PROCESSED B; CODON_START=l 

[SPLTT] ; >. 



700 

GTCGCGGCTTTTCCGGACACAGTTCCGGATGGTCAGCCCGAAGCGCATCAGCAACCC 
GAACAATACCGGCGACAGCCGGAACTGCCGTGCCGGTGTGCAG 

SRLFRTQFRMVSPKRISNPNNTGDSRNCRAG 

V Q> 

B (CAPSID COMPONENT;533); CODON_START=l; 

DB_XREF=PID:G215108; TRA [SPLIT] > 

SRLFRTQFRMVSPKRISNPNNTGDSRNCRAG 

V Q> 

. PROCESSED B; CODON_START=l 

[SPLIT] > 



800 

ATTAATGACAGCGGTGCGGCGCTGGGATATTACGTCAGCGAGGACGGGTATCCTGG 
CTGGATGCCGCAGAAATGGACATGGATACCCCGTGAGTTACCCG. 

INDSGAALGYYVSEDGYPGWMPQKWTWIPR 

E L P> 



FIG. 10(B) 



B (CAPSID COMPONENT;533); CODON_START= 1 ; 

DB_XREF=PE):G215108; TRA [SPLIT] > 

INDSGAALGYYVSEDGYPGWMPOKWTWIPR 

E L P> 

PROCESSED B ; CODON_START=l 

rspLm : > 



900 

GCGGGCGCGCCTCGTTCATTCACGTTTTTGAACCCGTGGAGGACGGGCAGACTCGCG 
GTGCAAATGTGTTTTACAGCGTGATGGAGCAGATGAAGATGCT 

GGRASFIHVFEPVEDGQTRGANVFYSVMEOM 
K M L> 

B (CAPSID COMPONENT;533); CODON_START=l; 

DB_XREF=PID:G215108;TRA [SPLTr]_ > 

GGRASFIHVFEPV ED GQTRGANVFYS VMEOM 
K M L> 

PROCESSED B; CODON_START=l 

rSPLITI ___> 



1000 

CGACACGCTGCAGAACACGCAGCTAGATTAACCCTAGAAAGATAATCATATTGTGA 
CGTACGTTAAAGATAATCATGCGTAAAATTGACGCATGGGATCC 
D T L Q N T Q> ( SEQ ID NO: 47) 

B "(CAPSID COMPO > 

D T L Q N T Q> (SEQ ID NO: 47) 
PROCESSED B; CO > 

: LEFT TERMINAL 

REPEAT > ' 



1100 

AAGCCGAATTCTGCAGATATCCATCACAGTGGCGGCCGCTCGAGCATGCATCTAGAG 
GGCCCAATTCGCCCTATAGTGAGTCGTATTACAATfCACTGGC 



1200 



FIG. 10(B) 



CGTCGTTTTACAACGTCGTGACTGGGAAAACCCTGGCGTTACCCAACTTAATCGCCT 
TGCAGCACATCCCCCTTTCGCCAGCTGGCGTAATAGCGAAGAG 

1300 

GCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGAATGGGACGC 
GCCCTGTAGCGGCGCATTAAGCGCGGCGGGTGTGGTGGTTACGC 

1400 

GCAGCGTGACCGCTACACIT(3CCAGCGCCCrAGCGCCCGCTCCTTTCGCTTTCTrCCC 
TTCCTTTCrCGCCACGTTCGCCGGCrTTCCCCGTCAAGCTCT 

1500 

AAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGAGCTTTACGGCACCTCGACCGCAA 
AAAACTTGATTTGGGTGATGGTTCACGTAGTGGGCCATCGCCC 

1600 

TGATAGACGGTTTTTCGCCCTTTGACGTTGGAGTCCACGTTCTTTAATAGTGGACTCT 
TGTTCCAAACTGGAACAACACTCAACCCTATCGCGGTCTATT 

1700 

CTTTTGATTTATAAGGGATTTTGCCGATTTCGGCCT 
TTAACAAATTCAGGGCGCAAGGGCTGCTAAAGGAACCGGAAC 

1800 

ACGTAGAAAGCCAGTCCGCAGAAACGGTGCTGACCCCGGATGAATGTCAGCTACTG 
GGCTATCTGGACAAGGGAAAACGCAAGCGCAAAGAGAAAGCAGG 



1900 

TAGCTTGCAGTGGGCTTACATGGCGATAGCTAGACTGGGCGGTTTTATGGACAGCAA 
GCGAACCGGAATTGCCAGCTGGGGCGCCCTCTGGTAAGGTTGG 
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2000 

GAAGCCCTGCAAAGTAAACTGGATGGCTTTCTTGCCGCCAAGGATCTGATGGCGCA 
GGGGATCAAGATCTGATCAAGAGACAGGATGAGGATCGTTTCGC 



2100 

ATGATTGAACAAGATGGATTGCACGCAGGTTCTCCGGCCGCTTGGGTGGAGAGGCT 
ATTCGGCTATGACTGGGCACAACAGACAATCGGCTGCTCTGATG 



2200 

CCGCCGTGTTCCGGCTGTCAGCGCAGGGGCGCCCGGTTCrrTTTTGTCAAGACCGACC 
TGTCCGGTGGCCTGAATGAACTGCAGGACGAGGCAGCGCGGCT 

2300 

ATCGTGGCTGGCCACGACGGGCGTTCCTTGCGCAGCTGTGCTCGACGTTGTCACTGA 
AGCGGGAAGGGACTGGCTGCTATTGGGCGAAGTGCCGGGGCAG 

2400 

GATCTCCTGTCATCTCGCCTTGCTCCTGCCGAGAAAGTATCCATGATGGCTGATGrA 
ATGCGGCGGCTGCATACGCTTGATCCGGCTACCTGCCCATTCG 



2500 

ACCACCAAGCGAAACATCGCATCGAGCGAGCACGTACTCGGATGGAAGCCGGTCTT 
GTCGATCAGGATGATCTGGACGAAGAGCATCAGGGGCTCGCGCC 

2600 

AGCCGAACTGTTCGCCAGGCTCAAGGCGCGCATGCCCGACGGCGAGGATCTCGTCG 
TGATCCATGGCGATGCCTGCTTGCCGAATATCATGGTGGAAAAT 

2700 

GGCCGCTTTTCTGGATTCAACGACTGTGGCCGGCTGGGTGTGGCGGACCGCTATCAG 
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GACATAGCGTTGGATACCCGTGATATTGCTGAAGAGCTTGGCG 

2800 

GCGAATGGGCTGACCGCTTCCTCGTGCTTTACGGTATCGCCGCTCCCGATTCGCAGC 
GCATCGGCTTCTATCGCCTTCTTGACGAGTTCTTCTGAATTGA 

2900 

AAAAGGAAGAGTATGAGTATTCAACATTTCCGTGTCGCCCTTATTCCCTTTTTTGCGG 
CATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGA 

3000 

AAGTAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAACTGGAT 
CTCAACAGCGGTAAGATCCTTGAGAGTTTTCGCCCCGAAGAACG 

3100 

TTTTCCAATGATGAGCACTTTTAAAGTTCTGCTATGTeATACACTATTATCCCGTATT 
GACGCCGGGCAAGAGCAACTCGGTCGCCGGGCGCGGTATTCT 

3200 

CAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTACGGATGGCAT 
GACAGTAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATA 

3300 • 

ACACTGCGGCCAACTTACTTCTGACAACGATCGGAGGACCGAAGGAGCTAACCGCT 
TTTTTGCACAACATGGGGGATCATGTAACTCGCCTTGATCGTTG 

' 3400 

GGAACCGGAGCTGAATGAAGCCATACCAAACGACGAGAGTGACACCACGATGCCTG 
TAGCAATGCCAACAACGTTGCGCAAACTATTAACTGGCGAACTA 

3500 



FIG. 10(B) 



CTTACTCTAGCTTCCCGGCAACAATTAATAGACTGGATGGAGGCGGATAAAGTTGCA. 
GGACCACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTG 

3600 

CTGATAAATCTGGAGCCGGTGAGCGTGGGTCTCGCGGTATCATTGCAGCACTGGGGC 
CAGATGGTAAGCCCTCCCGTATCGTAGTTATCTACACGACGGG 

3700 

GAGTCAGGCAACTATGGATGAACGAAATAGACAGATCGCTGAGATAGGTGCCTCAC 
TGATTAAGCATTGGTAACTGTCAGACCAAGTTTACTCATATATA 

3800 

CTTTAGATTGATTTAAAACTTCATTTTTAATTTAAAAGGATCTAGGTGAAG 
TTGATAATCTCATGACCAAAATCCCTTAACGTGAGTTTTCGT . 

3900 

TCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTT 
TTCTGCGCGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCT 

4000 

ACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCAACTCITTTTCCGAAGGTAAC 
TGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTG 

4100 

TAGCCGTAGTTAGGCCACCACTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCT 
CTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGT 

4200 

CGTGTCTTACCGGGTTGGACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCG 
GGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAAC 
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4300 



GACCTACACCGAACTGAGATACCTACAG.CGTGAGCATTGAGAAAGCGCCACGCTTC 
CCGAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTC 

4400 

GGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAAACGCCTGGTATCTTTATAG 
TCCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGT 

4500 

GATGCTCGTCAGGGGGGCGGAGCCTATGGAAAAACGCCAGCAACGCGGCCTTTTTA 
CGGTTCCTGGCCirrTGCTGGCCITTTGCrCACATGTTCTTTCC 

4600 

TGCGTTATCCCCTGATTCTGTGGATAACCGTATTACCGCCTTTGAGTGAGCTGATACC 
GCTCGCCGCAGCCGAACGACCGAGCGCAGCGAGTCAGTGAGC 

GAGGAAGCGGAAG ( SEQ ID NO: 46) 
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(PZJ -Bac-3YT? 



3aquanc* Range: 1 co 3 9 99 



10 o 



ACCSAAGTATACACT1LA\AXTC?\GTGCAXS^ 



20(2 



G AGCCTTGGGTGCAGC CTTGG^TGAAAACTCCCAAA i ' TTt/ gGATACQ 



30O 



'GA/EACCCACTTIIAATGJkXTCGCAOTGGAA/SGCrGCACCTGCAAAA^ 

400 

AGGAGGCGAGTCAACGO\£AXGCCGTACCTAGTAA^^ 

soo 

AAGTTT GAATAXSX?lAAAAAAATCATT TAXGTAAACAATAAC GTGACTGTGCGTTAGGT C CTGTT CATTGT TTAAT GA\AVEAAGAGC TTGAGGGAAAAAA 

60 0 

TTCGTAarTTGGAGXACGA^ATGCGTCGTTTAGAGC^ 

700 

ACTTAATCGCCTTGC2USCACATCCCCCTTO 

300 

GA\XGGCGCTrTGCCTGOTTTCCGGCACCAGAAGCGGTGCCGGAA^ 
GG<JU^TGC^CGGTTACGATGCXSCCCATCT^^ 



900 
TTGTXA 



1000 

CTOSCTCACATTTAATCrin?GATGAA^ 

1100 

CGCTGGGTCGGTTACGGCCAGGACAGTCGTT^ 

1Z00 

GTTGGA.GT GAC GGCAGTTAT CTGGAAGATCAGGAIAT GTCGCGGATCAGCGGCAXTT T CCGT GACGTCTC G^GCTGCTVIIAAACCGACTAjCACAAAXCAG 



1300 




1500 

CAC TAG GTCTGAACGT CGAAAACC C GAAJ^TGTGGAGO^CGAAAT CC CGAAT^'rt-'JJATC GTGC GGT GGTTGAACTGCACAC CGCC GAC GGCAC GCT GAT* 

1600 



CACGWKZAXCATCCTCTGCATGGTCAGGTIt^^ 



1700 



iaoo 

CGCATTAICCGAACCATCCGCTGT GGTTVCACG^ 

1900 

GAATCGTCTGACCGATGATCCGCGCTGGCX?J3C&GCGAXt^ 

2000 

TCGXTTGGGGAATGAATCAGGCCACGGCSCT^ATCACG 

2100 

GAGXICGACACCACGGCCACCGAZATIMTTGCCCGA^ 

2200 

GC T1TC <^C7Hh.C CTGGAGAGACGC GC CCGC T GATC Z TTT GC GAAT AC GC OCAC GCGATGGGXA^JIAGT CTTGGCGGTTTC GCT AAATAC T GGCAGGC GTT:T 



CGTCAGTATCCCCGTTTACAGGGCGGCTTCOTCTGGGAJST^ 

2 4 00. 

GTGAT T TT GGC GAIAC GCC GAACGATGGOCAG TTG TG^IAT GAAC GG-TCTGGT CTTTGC C GACCGCAC GCC GCAT C CAG CGC TGAC GGA : W5CAAA J _CAC CA 

" ■ ~ 2500 

G CAGCAG'nrT^TCCAG^rTCC G^TTTATCGGGGCAA^.CCATC GAA<5T GACCAGCCEAA31ACCTGTTCCGTCAT AGCGAXAACGAGCTCC TGCACTGGA2?G<3TG- 



I 2SO0 

gcgctggatggt^gccgctggcaagcggtgaagtgc^ 

GCGCCGGGCAACTCTGGCTC^AC^^ 

2800 

GGAAAACCTCAGTGTGACGCTCCCCGCCGCGTCC^ 

2900 

CAAS-r^ACCGCCAGTCAGCCTTTCT^ 

3000 

A1AACGACATTGGC GTAAGTGSAGC GACC C SCATTGACG^TAACGCCTGGGTCGAACGCTG€AAGGCGGC GG<^CATTACC>aS<^CGAAGC3\G<;gTTGa"a* 

3100 

GCAGTGCACGGCAjGAT^^ 

3200 

CG^AnrTGAXGG-l^CrGGTC^AAIGGCG^ 

3300 

AG«3G<^GAGCGX5GTAAACTGGC^CGGA 

.3400 



3S00 

TTCAACAJCAjG^Ct^TACAGTCAAC^ GGCTGAAIATCGACGGTTT CC 

3S00 

ATI^TC^GGATTGGTGC5CGAjCGAJCTCCTGSAjGC 

3700 

ATCCGTCGACTAAGGCCAAAGAGtTCT^J^^ 

3800 

AAT&TTTCGAT C GAAT AITTC C1AII1?VGAACAAC1AATAGXAXTAC CTAATTAC CAAGTCTT&ATTTAGCAAAAATGTTyLx' 1 l^Tll^AGAAAAAATJiaAXTAT 

3900 

TXATTTGAAAX TTAAA.GTCAACTTGTCATT^ 

4000 

TC 7 AT ATTAGT GAT GATTTC TTTAGCTAt=TAA3AC Al TT 1A ATTATA T JCG G C TTT G AXC ATT I T CTGATTTTTrCCGAACGGArTTTCGT?US\ircCTTT 

4X00 

C GATCTTCIATA^VTGSCT CAITTTAIT G^GGACGGTCAC GAGAG CTCCACTTTT GAAXTTCTGTTC GCAGACXC CGCA ^ 1 l^J_ ^CAC3iXA&C C GGGAC 

4200 

ATCCGGnTTTGWGAGATTT^CCAG^ 

4300 

CTAGCATTTACATAAGe^TCAl9<Z7^^ 

4 40 0 

GC GAACC GCATLACAAAGCTCTCGCCIGCAC^ 

4500 

C T CTTOCTGCACAXTT 7GCAGGAGTACGGC C TTT GAC C CGTG^GCAATCGCATGTGTC GC GC CAGA. 1 TO 1' I?C x GCGAAAtAAACTTCTTGGAGCAGATGC 

4S00 

GGGCG^CCGGGGTGG^CGAAGAACTCCl^GCATGAG^ 

4700 

AGAAGC-CG^SCGC^GGAAXCGT^AAT^ 

5300 

AGGCGATAGAAGGCGATGCGC*?GCGAJVTCG^ 

4900 

G GGrT^AGC CAAC GC TATGTC C TGATAGCGG T CC GC CACAC C CAC-C C GGCCACAGTC GATGAATC CAGAA^AGCGGCCA I L' V i' 1' C CAC CAT GAT AT T CGGCAA 

5000 

GCAGGCATCGCCATGaKSTCACGACGAGA^ 



TCCA^TCATCCITCMCGACAA^ 



<^ATC^CCGCCGCATTG_^^ 
^CAGTCCCTTCCCGCT^ 



T^T?CAGGGCACCGGACAGGTCGGT^ 



5<J0O 
GAXTGICTGTT 



^CCCAGTCATAGCCC^TAGOT^ 



5S00 
TCTC 



TTGAXCAGArCTTGATCCCCT^GCC^^ 



rTTCCCTTGTCCJU_vrAGCCCAG^ 



5600 

CCLAACCTTACCAGAGGGC GCCCCA 
5700 

TT TGCGCTTGCGT 
5800 

:gtottccgcttcctttagc\gccctt 



5900 

- GAAATCGGCAAAATC C 



C ' I ~^ T ^ >k ^' ^ ^~?^^ G ^ZLXG<1 C C GvUSAOlAGGC^TTGAGTGT TGTTC CAGTTTGG AAIlAAl^GT CCACTAT TAAAG AAC GT GGACX C CJUUT GTC^J^AOSGC G 



AAAAA.CCGTCTATCJU5G^GATGGCCGGATCACCX 



TAXG-GOTGTGAAAXACCGCACAG^ 



sioo 

XT 



cctcgctcactgactcgctx^goix:ggtc^^ 



S20O 

AGAAXeAGG45GAXAA 



64QQ 
:GCTCrcCTG 



TXCCGACCCTGCCGCTTACCGX5^ACCTGT CCGCCTTTCTCCCTTCGGGAAGCGTGGCGC T T JJl 1 _\XA< 



6500 

.GCTCAO^CTGTAGGTATCTC^GTTCGGTGTJ^ 



GGTCGTTCGCTCCAAGCTGGGOXGTGTGCACGAACCCCCCGTTCAGCCCGi^ 



GCTACACTMAAGGACAGTATTTGGTA^ 



TGGTAGCGCCGGl 1 i 1 1 I&TTTGCAAi3CAGCAGATTACGCGCAG^ 



6900 

CTTTTCTTACTGAACGGTGATCCCCX 



CCGGAi/TTC^GG^CGCGGAATTCTCA^ 

3XP3-EYFP MARKER > 

CTO^XCTAATTCAATTAGAGACTAA^ 

. MARKER " 



GGCGCTTTCGTCTACGGAGCGACAATTCAATTCAAACAAGCAAAGTGA 3 . 

— 3XP3-SV? P MARKER 



AACAAX C GGG^TAfXGCTAGAGTC GACGGT ACGAT C CACC GGTCGCCACCAXGGT GAGCAAGGGC GAGGAGCTGTT CAC C GGGGT GG" 



7300 

TGCCCATCCTGG.T 



" CGAGCTGGACGGCGACGTAAACGGCCA^ ^JTTCATCTGCAC- 
: _3X?2BYF? MARKER "> 



Fig.; /£(3) conh. 



750O 

AGCGGCAAGCTGCGCGTG^CCTrGGCCCACCCrGGTGACCACC^^ 

3X?3-rrr?3 MA3K3R 

tcttcaagtccgcc^tgcccgaagx^ 

; 3XP3-2YF? KA51X3R 

770O 

CGAC\C C C TCGTGAACCC<ATCCLAGCTGAAGGGCATCGACTTC\\G^ TACAACAGC CACAAC 
3ICP3 -=TYT? MARKER . 

7300 

GTCTATATCATGGCCGACAAGCAtfSAAGAACGGCATCAA^ 

3XP3-EYF? HA<*KER ^ 

7S00 

AGCAGAACACCCCCATCGGCGACGGCCCCGTGCTG^ 

: 3XP3-EYTP MARXSR > 

3000 

T CACATGGTCC TGCTGOAGTTCGTGACCGCCGCCGGGS TCACTCT CGGCATGGAC GftGCTGTS»CAAGTAAAGCGGCCGCGACTC ^AJSAXCA^AATCTagc C 
i ! : [ 3XP3-EYEP MARKER ^ > 

aioo 

ATACCA.CAT TT GTAGA GGT TTTAC TTGCTT TAAAAAACCTTCCC?UIACCT C CCCCT GAAC C TXGAAACAIIIAAAATGAATGCAArTGT T G TT GT TAAC T TGT T 
3XP3-BTEP MARKB R ' > 

92QO 

TATTGCAGCTTATAATGGT^C^ OAACTC 
„ 3XP3-SYFP MARKER ' >> 

8300 

AT CAAT GTATC TTAAACCTT AXCGATAC GC GTAC GGCAC T AGTGGATC CCAT GC GTCAAT TTTAC GCATG ACTAIX T TT AACGT AC GT CACAATAT GA T T 

< _I#EFT TERMINAL REPEAT 

3XP3-EyFP MARKER > - 



8 4O0 

ATCTTTCTAGGG^TAXTCTAGCTGCGTGTTCTGeA 
< 

9 50 0 

TCCTCCACGGGTT<ZAAAAACC^GAATGAA 

9600 

CCTCGCTG^GTAATATGCCLfrUSCGCCG^ 

8700 

CT^CGGGCTGACCATCCGGAACTCTGTCCGGAAAA^ 

3900 

AC^CCTTCCCGAATCATCATGGTAAACGTGCGTTTTCGtrT 

390Q 

GGGAAAAGGCACGGGCXTCrTTCCTTCCC^GATGCCCA 

C TAGAT T AAC C C TAGAAAGATAG^C TGCGTAAAAXTGAC GCAT GGG ATC C C C C GGGCTGCAGG AATTC GAT AICAr^GC T TATC G ATAC C G^ GAAGC T' 1 * 

< RIGHT TRPMIXAL REPEAT C SEQ ID "MO 
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pC?Z)-3ic-£CFP h~lf~ I nt ) Sccrua^c, Range: 1 co 0012 

loo 

^C^AG^\TAC\CTTAAA7TCAGTGOACGT^^ 

AAAAAAAAATGAAAT AT T GCAAATT TT GCT GCAAAGC T GT GAC TGGAGTAAAATT AATTCAC G T G C C GAAGT GT GC TATT^Ai^GAAAA J^TGT GGQAGC A 

G AGCG T/T G GGTGCAGC C TTGGTGAAAACT C CCAAAT TT GT GATAC CCACTTTiAAT GATT C GCAGrGGAAGGCTGCACC T GOAPJUKJTCAGACATTTA-XA. 

400 

AGGAGGCGACrCAACGC^GATGCCGO^CCTAGT^^ 

500 

C GTGACT GTGCGTXAGGT C C IX? TTCATTGT TTAATGAAAAXAAGAGC TT GAGG<^J3^i\AAA 

600 

TTCGTAC j. T i-J»<3U?TACGi\AAXGCGrCGTTTAGAGC^ 

700 



700 

ACTTAJVTCC^CTTGC\GCACATGCCCCTTTCGC^^ 

900 

GAATGGGGGTTTGCCrGGTTTCCGGCACCAGA 

. 90 0 

G<3C^3ATGCACGGTO^GATGCGCCCAXCTACACCL^ 

1000 

ctcgctc^aca^tttaatgttgatgaaagctggcta^ 



1XO0 



CG^TGGGTCGGTT^GGCOUSGAO^ 

1.20 0 

GTTG<GAGTGACGGCAGTXAXCTGGAi\GATtZAGGAXArGXG<SCGG 

1300 

CC^TTTCCATCTTGCCACTCGCTrCAAT^ 

1400 

A<CAGTTTCTTTATGGCaGGGTGAAAC£^^ 

1500 

CACTAC GTCTGAACGT CGAAAACCC GAAACTCSTGGAGCC^CGAAATCCCGAATCTCTAXC GTGCGGTGGTT^AACTGCACAC CG CC GAC GGCACGCTGAT 

_ 1S00 
TGAAGCAGAAGCCT^GAXGTCGGTTTCCGCGAGGTGCGGATTGAA^ 

1700 

CACGAGC^TCATCGTCTGCATGGTCAGGT^ 

j laoo 

C &CAXTATC CGAAC CLAT C CGCTGTG GTACACGCTGT GC SACCGCT AC GGC CTGTATGT GGTGGA1 GAAGC GAATAXTGAAAC CCACGGCATGGT GCCAA7 

1900 

GAATTOTCTGACCGAXGATCCGCGCTGGCTACCGGCGATG^GCGA 

2000 

TCGCTGGGGAATGAATCAGGCCACGGCGCTAATCACGA^ 

.2100 

GAGCCGAOACCLACGGCCACCGAXATTA^ 

GCTTTCGCTACCTGGi 

2300 

C GTCAG TAXC C C C GTT TACAGGG CGGC T T C GTCT GGGACT GGGTGGAT CAGTCGC T GATT AAATAT GATGA^^AC GGCAAC C C GTG GT C GGC TTAC GGC G 

2*00 

. GA2TTTT(^GAIACGCCG,AAC^ 

" 2S00 

GCAGCASTTXTTCC^GTTC C GTTXATCCGGGCAAAC CATCGAAG rGACCAGCGAAIAC CT GTTCCGTC?.TAGCGATAAC GAGCTCCTGCACTGGAX" GGT G 



2S0O 

GCGeTG<SATGGTAAGC CGCTGGCAAGCGGTGAAGTGCtrTCTGGAXGTCGCTCC^ rGAACTACCGCAGCCG^AGA. 



? 2700 
GCGCCGGGCAACXCTGCCTCAiTAjGTOCGG^ GGGCACATCAGCGCCTGGCAGCAGTGGCSTC TGGO 



2800 

GGAAAACC TCAGTGTGACGCTCCCC GC CGC GTCC CAC GCCAXCCGIGCATCTGACCACCAGC3AAAT GGJJT TTTTGCAiTCGAGCXGGGT>J\JAA.GC(2TTG<jr 



2900 

CAATTTAACCGCCAGTC^GGCl. L i I L_ACAGA.T GTGGATTGGC GAIAAAAAACAACT G CT GAC GCC GC TGC GCGATCAGTXCA£TC C G TGCAC CGCT 3 G 



3000 

ATAACGACAX^GGCGTAAGTGAAG^GACCCG 



3100 

GCAGT^SCACGGCAGA3LA£^ACTTGC^ 



3200 

C GGATT GAT GGTAGTGGTCAAAI G&C GAXTAC CGTTGAT G TT GAAGTGGC GAGC GAIIIACAjC C GCXT CCGGCGCG GATTGGC CT&AAC T GC.CL? jGCTTGGCGC 



* 3300 
AGGTAGCAGAGCGGGTAAACTGGCTCGGATT^GGGGCGCAAGAAAA CTGCCATTGT C 



3400 

AGACATGTATAGCCCGTACGTCTTCCCGAG^XAAAACG^TCTGC 



3500 

T TCNATATCAG^CGGTACAGTCAAC^^^ c 



3600 

ATAXGC-GGATTGGTGC^i^ACGAiZT^ 

. 3700 

ATCCOTCGACTAAGGCCS\AA&AGTCTAATTTTTOT 



3S00 

AAXGTTrCGATCGWU'ATTCCAIAGAJUZA^CAA^ 



3300 

rT7a7TTGAAikTTr?sAAGTCAACTTG 



1000 

TCXAX?VX^L^TGATGArTTCXTTAGCTAOT 



4100 

C GAXtTT CAXAA TGGCTCATT TTA2TT GCGAT G<^C GGTQ\GGAGAGCXC CACTTTTGAAAl'T '1X> 1 ' it TGCAGACAClCGCATTTGTAGCACATAGCCGGGAC 
AXCCGGT-rTGGGGAGATTTTCCAGTCTCTG^r^^ 

4300 

CTAGCAXTTACAXAAGXiAXCA^ 

4100 

GCGAACCGCJU^CAAAC^TCTCGCCGCACACT 

4500 

C X CTTX^TT^ACAT TTTGCAGGAGTAC CTTTGAC C CGTGTGCAAX C GCATGTGTCGCXK: CXGCXTGT TCT GC^SAAAOS\AAC TT CT T GG AGCAGATGO 

4600 

GGCCGCCCGGGG^GGGCGAAGAACTCCAGCATGAGAXCCCCGCG^ 

4700 

AGAAG GCG GCGGTG<SAA!rCG>AX3!C TCGTGATGGCAGGTTGGGCGTCGCT"©^ CGG^CATTTC GAACCCCAGAGTCCC CCT CAGAAGAAjCTTCGTCAAGA 

4600 

AGGGGATAGAAGGC GAT G C G CT GC G AAT C GGGAGC GGC C^JTACCGTAAAGCAC GAGGP AGC GGTCAGC C CATTC GC C GCCAAGC T C TTCAGCAATATCAC 

<?900 

GGGTAGCCAAC GCTAT GT CC TGATAGCGGT CC GCCACACC CAGCC GGCCACAGTC CATGAATCCAGAAAAGC G GC CAT T TTC CAC CATGATA07TCGGCAA 

"_"-.Zl~ . '. ' . Z'J^- " * £'rf' : " " * —..5000- " 

GCAGC-CATCGCCA^C^GTCIACCA^ 




Fl 0. /3(B) Corrh. 



GCGTATG<r?jGCCGCCGCATTGCATCAGCCA^ 



520O 



^CCCAGGAGCAAGGTG^^XGACAGGAGATCC^ 



"530 O 



rCATTCAGGGCACCGGA£^AG<^GGTCTTGAC^^ 



GTGCCCAG" 



~C^^CC^AZAGCCTCTC^^ 



TTGAIXIAG^rcTTGAXCCCCXG^GCCAXCAGAXCCTXGGCJ 



S60Q 

«SCAAGAAAGCCATCCAGrTZACn^GCA^ 

5700 



TTTCCCT 



TG7TCCAGM2AGCCGU30ajGCTG^CATTC^^ 



'GGCTTTCTAC GTGTTi 



5800 

CCGCTTCCTrTAGC?_GccCT'r 



GCCC(=CTGAGTGCTT&CGSCAGCGTGAAGC TAATTCMGGTTA2MATTTTT< 



'GTTAAAXlIAGCT(IATTTTTT3^ 



CTTATAAAI^AAAAGAAIAGCCC 



rTGGAACAAGA GTCCACTATT AAAGAACGT GGACTC CAAC GTCAAACiG-5CG 



AAAAACCGTCTAXCAGG<^GA^ 



S100 

«^CAGATGCGTAAGGAGAAAAXACCGCM 



6200 

^CGGTTAXCCXczAGlAATCAGGGGAIAA. 



C GOAGGAAAGAACATG TGAGCAAAAGGC CAGCAAAAGGCCAGGAACCGTAAAAAG GCCGCGTTGC TGGCGTTTTTCCAT J\X3GCTC C<^C CC CCT^ACG^G 



6400 



TTCCGACCCTGCCGOTTAC^GGAXA^^ 



G<5TCGTTCGOTCCAAG<ri^GGCTOTGax^ 



6700 



OACGACTXAOXGCCJ^TCGCAG^^ 



6800 

GGTMCTCTTGATCCGGCAAACAAACCACCG^ 



6900 



TGGTAT^ GGCGGTTTTTTGX TTGCAAGCAG4ZAGAT T AC GC GCAGAAAAAAAGGAT CTCAAjGAAGAJTC CTTTGAT CTTTTCTTAC TGAAC GGTGATCCCCA 



CCGGJVATTGCGGGCGGGGAAT^TXirrCATGTTTGA^ 

3XP3-3YFF MARKER > 



C GGGGATCTAA TTCAAT "TLAGAGAC TAATT C1AATTAGAGCT AATT CAATTAGGATC CAAGCTTAtfCGATTT CGAAC CCTC GACCGCCGC^ jGXA 



_3XF3-EY5T 



7100 
tSAAAXAGA 



i 72O0 
^ v= ^" TCGT C T AC GGAGC GACAAT TCAATT CAAACAAGCAAAGTGAACAC GTC G CTAAG CGAAAGCTAAGCAAATAAACAAGC GCAGC TGA\CAAGCXA 
_ 3XP3-5YFP MARKER > 



7300 



AACAAXCGGGGTACCGCTAGAGXCGACG^TACSAT CCACCGGTCGCCACCAXGGTGAOTL^GG^ t> CCTGGT 



3XP3-5YT? MARH3R 



C SAGC j. GGACGGCSACGTAAACGGC CACAAG^ TC\G<1GTGTCCGGCGAC-GGC GAGGGCGATGCCAC CI!? JC GGCAAGCT G AC C CT GA.- GT TCAT C TGCACC 



F~/0. /SCB) Corrh. 

ACC«^OT«CCGTGCCCTGGCCC^CCTCGT<^^ 

— 3x?3-sicr? matucer " 3 

1 — . 3XP3-2Y5"? MARKER > 

C ^CACCCT(~TGAACCGCAT 

— ■ 3XP3-SYF? MASKER " -^^CAAC 




TCACATGGTCCT^TGGAGTTCGT^ 

^2XP3-EVF? MARKER > 

^CCACAXTTGTAGAfi^^ 

: 3*p3-EYyp marker ; > 

TAXTGCAGCTTAIAATG<5rXACA^ 

■ 3XP3-SYFP MARKER . \ 

CAAX ST AXCTTAAAGCTT ATCSA.TAC GC GTAC GGGGCGCCTAGGC CGGC CG AX TGGAH C CCAT GC GTCAATT TT3V.C GCAT GATXPuTCTT TAACGXAC G 

< LEFT TERMINAL REPEAT 

3XP3-KYFP MARKER _> " ; 

GCGAGTCTGCCCGTCCTCCACGG^ITTCAAAAACGTGA^ 

GGTTGCTGAT^GCTTCGG^CTGACCATCC&GAACTGTGTTC 

^&TG<^GGCCA^CCTTCCCGAAX^ 

GCTTCAACCTCGCGGGAAAAGGCLACGGGCTTCTTCCTC^^ 

GAXCCXSAXGCAGCTJ^TTAACCCTAGAAAGA2^GTCT^ 

< RIGHT TERKTNAt, REPEAT 

CCGTCGAAGCTT ( SEQ ID NO I 49) 



p(PZ)-Bac-EGFP 



Hindlll 




Sail . 



FIG- 14 CA) 



) 



?2) -Sac-SGrP 

' Sequence ?.anga : 1 Co 9013 

ACC<5AAG1L\XACAC?TAAATTCAGTC3CACG?TTGCTTGTT 

SAGCCT7GGGrGC\GCCTTGGTGAAAACTCCC^ 
A^GAGGCGACTCAACGC^^ 

TT CGTAC TTTGGAGT ACGAAATGCG T CGTT TAGAGCAGCAGC C GAAT TCACTG GC CGT CG TTTTACAACG T C GT GACTGGGAAAAG C CTGGCGITTACCCA. 
ACTTAAXCGCCTTGCAGCACATCCCCCTTXCGCC^ 

gaj^ggcotttgcctggtttccggcaccagaagcggt^ 

GGCAGATGCACGGTTACGA^ 
CTCGCTCACAiTT^^GTTGATGAAAGC^^ 
CGCTGGGTCGGTTACG^CAi^CAOTCGTTTGCCGTCT 

GTTGGAGTGAC GGCAGTTAT CTGGAAGATCAGGATATGTGGCGGATGAGC GGCATTTTCC GTGACGTCTCGTTGCTGCATAAACCGACTACACAA?i^CAG 
CGATTTCCATGTXGCCACrrCOTnri^^^ 
ACAGTTTCTTTA^CHSCAGGGTGAAA^ 

C^CTACGTCTGAACGTCGAAAACCCGAAACT&TGGAGCG^^ 
TGAAGCAGAAGCCTGCC^GTCGGTTTCCGCG^ 
CAT:G^GCATCAJCCTCrGCATGGTCAGG^CAr^ 
CGCMTASCCGAACCATCCGCTGT^ 
GAATCGTCTGACCGATGATCCGCGG^GGC^ACCGGC 
TCGCTGGGGAATGAATCAGG^CACGGCG^ 

2 1 0 0 

GAGCCGACIACCACGGCCACCGAIA*^ 

SC T TT C GCTAC C T GGAGAGAC GC GC C C GCT G AT CCTTTGC G AAXAC GCCCACGCG AT GGGTAACAGTC XT GGC G GTTTC GC TAA : ^!ACTGGC.^ JG GC GTTT 

2300 

CGTCAGTATCCCCG^TTX?«CAGGGCGGCTTCGTCTGGGAJCTGGGTGGAlCAGT 

2400 

G - ^-^^"^GGO(^TAC GCCGAACGATCGCCAG7 TCT GTAT GAACGGTCTGGT CTT T GC CGACCGCACGCC GCAXC CAGC GCTGACGGAAGCAAA?XACCA 

~~ " 2*0 0 ' 

^^■^"^^^^^^"^GAGTTCCGTTTATCCGGGCAAACCATC GAAGT GACCAGCGAAT AC CTGTT CCGTCATAGCGATAACGAGCTC CTGCACTGGATGGTG 



F\Cr. WLk) Cent. 



GCSCTGGATGGTAAGGCGCTGGCAAJSCGGTGAAGTGG^ 



270O 

GCC^CGGGC^CTCTGGCTCAC^G^CGCGT^ 



GGAAiACCTC^GTGTCAC^TCCCCGCC^GTCCCACGCCA^ 



CAATTTA?\CCGCG\GTCAGGCTTTCTT^^ 



AIISVACGACATTGGCGTAAGXGAAGCGArc^ 



3100 

G^AGTGCACGGCAGSU^CACTTG^ 



3200 

C GGAIT GATGX^TAGT GGTCAAATGGCGATTAC CGTTGAT GTTGAAGTGGCGAGGGiADJIACC GCATCCGG^I GCGGATTGGCCTGAA/I T GX1CAGC rGGCGC 



3300 

AGGSMCAGAjGCGGGI^AACTGGCTCGGATTASGG^C^^ 



AGACATGTATACCCCGT^CGTCTTCCC 



3500 

TTCAACATCAG^CGCTACAGrC^CAGCX^CTQ^ 



3600 

ATLAJG^GGA^GOTG^GACGACTC 

3700 

ATCCGTCGA£TTAAi5<^CAAAGAGTCn!AAXTTTTGTTCAICA^^ 



3300 

AATG^TCG^CEAATAOrrCCLASSU^J^CA^ 



3S00 

TXAOTirrGiVAAXTnSVAAGTCAACT^ 



4000 

TCTATATTAGTGAT(^rrTCTTTAeC3MTAAa^ 



4100 

CGATCTCAXAATGGCT CAXT TTATTGCGAI GGJVCGGTCAGG3USAGCTCCACI 'IT 1 GAATTTCT GTTCGCAJGACACC GCATTTG'XAGCACATAGC C GGSVG 



92O0 

ATCCGGTTTGGG^3U3ATTTTCCAOTC^^ 



4300 

CTAGCATTTACATAAGGATOAGCAGC^ 



4400 

G^ZGAACCGCIACACAAAGCTCTCGCCGCACACTCCACAlCT 



4500 

CTCTT^TG^CATTTTGCAGGAJ^T^GG^ 



4o00 

GGCCGCCCGGG^GG^CGJtAGAAC^COVGC^GAGA^^ GGAAAACGATTCCGAAGCC CAACCYTTCAT ■ 



4700 

AGAA<=<^:^eGOTGGJVATCGAAATCTCG^ 



4300 

AG^GATAGAAGGCGATGCGCTGCGAATCGGGAGCGGCGA^^ 



4900 

G GG-TAGCCAAXl GCT AT G TC C TGATAGCGGT C C GCCACACC CAGC C GGC CACAGTC G ATGAATC CAGAAAAGC GGC CATTTTCCACCA7 G AT ATTCSGCAA 
GCAGGCAT C GC CATGGGTCACGACGAGAT C CTCGC CGTCGGG^IAT GCGCGC CTTGAGCGTGGCGAACAGTTCGGCTGGCGC G^.G^CC C!TG~AT GCTCTT CG 



I 



) 



Fig-. Mb) conf. 



rCGWJTCA7CCTGATCGAOJ\GA^ 

G CSTAT GCA^C C GC CGCATT GOAT CAGC CAT GAXGGAIAC TTTCT C GGCAGGAGCAAGG T GAGAT GACAGGAGAT CCTGC CC C GGCACT T CGCC CLAIAG 

"5-3 00 

CJU3CO-G0NCCCTTCCCGCTTC^^ 

T G\TTCAGG^ZACCGGACAGGTCGGTCCT TG^TT 

550O 

GTGC CCAgTCATAGC CGAATAGCCT CTCCAC CQLAGCGGC CGGAGA^(TTGCGTGCAATCCAX U I l^l 1 ^ CAATCAXGCGAAACGAXCCTCATCCTGTCTC 

5600 

CAXCAGATC TT GATC CC C TGCGC CATCAGATC CTXGG C GGCAAG AAAGC CATC CP.GTT rACTTTtEiCAGGGCTTC CCAAC CTTLAC CAGA-G<GGC GC C C CA 

S7oa 

GCrrGGCAATTCCQSTTCGCJTGCTGTCCAir^A^ TTCT C 1"! T GCGC ITCCGT 

5300 

TTTCCCTTGTCCAGATAGCCCAGTAGC:nG^ 

59O0 

GCGCCCTGAGrGCTTGCGGCAG<OTGAAGCTAVrTCATGGTO 

6000 

CTTAXAAATOVXAAGAATAfiCCCt^^ 

5100 



52 0 0 

C CTCGCTCACTGACTCGCTGCGC^GG^^ 
C&0\GGAAAGAACATOTG&GC3*AAA^ 

5400 

CATCACAAAAATCGACGCTGA?JSTCAGAGGTGGCG^^ CCCCTGGAAGC TCCCTCGTGCGCTCTCC T G 

ssoo 

TTCCGACCCTGCCGCTTACCGGATACCnrGTCCGCCTTTC 

5600 

GGTCOTTCGCTCCAAT^TGGGCTGTCT 

5700 

CACGACTTATC GCCAC TGGCAGCAG C CACT GGTAACAG<GATTAGCAGAfiC. GAEG'TATCIAJCGO^TGCTACAGAGTT C XT GAAGT GGT GGC CTAAC T AC G 

5900 

C^CTACACTACSUUXiACAGT^^ 

j 

S900 

T&ST1AGCGGC&GTTTTTTGTTTGCAAG 

7000 

GCGGAATTGCGGCCGCGGAATTCTCATGTTTGACAGCT^ 

3XP3-EYFP MARKSR > 

7100 

CGGGGATCT^ATTOATTAG?£AC T AATT CAAT TAG AGC TAAXTCAAXTAGGAT C CAAGC TTATC GATTTC SAACC CTCGACCGCC&GA QEAT A3 ATAGA 
. 3XP3-S^TP MARKER > 

7200 

GGCG^TTCGTCTACGGAGCGACAAT/TCA^TT^^ 

. 3XP3-5YT? HAKXER > 

7300 

AAXLAATCGGGGTACC GC TAGAGTCGACGGTAC CGC GGGCC CGGGATCCAC CGGTC GCC?£ CAI-GGTGAGCAAGGGCGAGGAGCT GT TCACC GGGGT GGT G 
[ 3X?3-3YT? MARKER > 

ft^- - - - .. .. - .JlILfVL 7400 

CCCATCCTGGTCGAGC TGGACGGCGACGTAAACGGCCACAAGTTC AGCG^G^CCGGCGAGGGCG^-GGGCGATGC CAC CTAC&GCAAGCT GAC C C TGAAGT 
3XP3-SYTP MAR3C3R > 



FlCr.. NCh) 



■ 7500 

TCATCTGCiXCCACCGGCAAGCTGGCCGTGCCCTG 

3XP3-EYFP MARKER > 

7500 

GC\GCACGACT TCTTC^jOTCCGCCAT 

3XP3— EYFP MARKER. ; > 

7700 

AAGTTCGAGGGCGACACCCTGC— TGAACCG 

_3XP3-~ ? MARKER ■ > 

7300 

AC^GCCAjCAACGTCILATArCATGG^ 

3XP3-5VTP MARKSR > 

7300 

CGACCJ^jCTLVICAGCAGAACACCCCCLATC^ 

3XP3-— ST" P MARKER > 

aoao - 

GAGAAGCGCGATOjCATGGTCCTGC TGGAG— TCGT GACCGCCGCCGGGAJTCACTCTCGGCATGGACGAGCTOT CTAGAT 
3XP3-SYTP MARKER ^ 

5100 

CAIAATCAGCCAI^CAO^TTG— AG^ 

. 3XP3-EY— P MARKER > 

3200 

GTTAACTTGTTTATTGCAGCTTA25U^ 

3XP3-EYFP MARKER > 

8300 

TGTCCAAACTCATCAATGTATCTTAAAGCTTATCGAX^ 

< LEFT! TERMINAL REPEAT 

. 3XP3-BYFP MARKER > " ' r 

8400 

GTCACAAICATGATTAXCTTTCTAGG<~rXAATCTAGCT 
< LEFT TERMINAL R2PEA 

3500 

CGCGiVSTCTGCCCGTCCTCC^£^3GGTTC^JU^ 

3600 

GCCAGG30ACCCGTCC - CGCTGACGT>ATMCC 

B700 

GGGTTGCTGATGCGCTTCGG^TGACCAITCCGGAACT 

8800 

AGGCGTGCATGGCCIAXIACCTTCCCGAAXCL^ 

8900 

C GOT^XlAACC^CGC^GAAAAGGCACGGGCTTCrTCCTCCCCGAX 

90 0 0 

TGATCCTGATGClAiSC _\GATTAACC GACi>CATGGGATCCCCCGGGCTGCAGGAATTCGATAXCAAGCTTAJCGAT 
< RIGHT TERMINAL REPEAT 



accgtcgaagctt ( SEQ IDNNO : 50) 



.pX.L-Bac-EYFP 



20 0 



Sad 

-s. / 

' Notl 
Xbal 

. ji .Sad 
BamHI 



3 



J<pnf 
-Sail 



HindlH 



°09Z 00VZ °° 



Pstl 



''19' Repeat 
Kpnfj 

Apal 
Xhol; 
Sail 
Hindlll 
Ecofll 
Pstl 
BamHI 
Spgl 



Hindlll 



Notl 
Xbal 



Pstl 



FIG. 15^/) 



FiO. '/5Q& 



pxi. -Ba c - syfp 



Sconce "S^jngc: 1 to 49 Si 



200 



C C G^C-AiTCAGGGCGXTGGC CCACTACGTGAACCATCAjCC C TAAt CAAGTTmTT I 



CCCCCGATTTAI»£CTTGXCG^G?JWCCCG^ 



CGOTGCGG^CTCTTCSCTA.rT? J CC^Ca J GCTGGCGAiWU^^ 




ATCCCAGCGC C GCAC CGCTGTCW Ti 



BOO 

! TGACCAT 



CCG<^CTGTGTCCG«AAAA^CGCGACG^CTGG^CCCA^SGCCT 




< WGHT TfiWCC^ REPEAT 



12QO 

GCGOTGGCGGCCGCTCI?jGAACTAGT 



— - __ MAK53ZR . 




G^CIGACCCrG^GTTCATCTGOXCACCC^^^ 

. 3KP3-BYTP MARKEB-_ * 



CTACCCCGAi 



rrCAAGTCCGCCAIGCCC 



5AGCGCGACACCCTGGTGW.CCGCXTC5ACC T OAAi&3<3<ZAT Ct=ACT1 



AGCT^GAGtXCS,AC IAC\AC\GC CACAAC GICT AST A 



jJxjes-aYrs MAaK2^ 



C3GCA£SCC5TC3CAGCTCGCC(SACCACTACCAG 



_3X3?3-SY?P MARKRK 



F,& /5m) 



CGTCGXTrG^TATG^Tr^ 

. _AKPCIUC.rN 3ESISTANC5 " " C 



TCCGXTCGTSGTCASAA^ 

. AMPC JXLXN RES ISTANC3 " "~ ' " "~ ~ 



_ AMPCXI.I.XN RESISTANCE 



470O 



-*?^^GAACTTTAAAAGTGC?CA?C^ , 
AK?CILLLM RESISTANCE __ 



TCGTG^CCCAACTGATCTTCAGCAXCTTTTACTTTCACCAG^ 

AKPCIU1EN RESISTANCE 



ACAC GGAAATGTTGAATACT CA3AC TCTTCCTTTn 
AMPCILLIN RES ISTAKC3 > 



AG AAAAAT AAACAAAT AGGG GTT C C GC GCACATTT C C CC GAAAASTGC CAC (SEQ ID NO" 51")t 



Ft 





GCGTCAATTTTAC GCATGAT TATCrTDiAC 
< LSST TEWKdAL REPEAT 



ATlt>TTAXCCGCTCACAATTCCACXCAAC\IACGAGCCGGr^k^^ GGGGTGCCTAATGA&T 



x;oi£i origin 



JOOO 

:ggccaacgcgcggggaga 




GGCGGTTrGCgrATTSGGCGC i'L I1CCGCTTCCTC 



G TT r tTCCAIAGOCTC CGCCCCCCT OPJZGf<GC^T^C^Z^^J^.T CSACGCT CAAGTCAGAGGTGGCGAAACCC 



320O 

:?AAAAAGGCCGCGTTGCTGGC 
330O 

TACCAGCCGTT T 
340O 



CCCCCTGGAAGCTCCCTciTrCKlGCTCTCCTOTTCC^CCCTGCCSCTTA^ 




GIA^rcrr^CCGGGV^AAC(^C5CTC^TAGC£GTCGTT??^ 
TTTGATCTTTTCr^GGGGTCTSACGCrCAGTSGXACGA^ 

"iQOO 

AMPCILLI2I SESISTANC3 > 

4 2 0 0 

AfrCFCXI^LZN RZ3I3TANCZ 



ec?»fAC CCACGCT CAC CGGCTCCAGATTTM!^ 



pXL-Bac-EGFP 




ftg. 16 (A) 



Sequence Ran 5 e: 1 co 49S2 



^^^^^^^ 



=GrGGACTCCX?jC GTCAAAGC-CCGAAAAA 




50O 



C GGTGCGGGC CTCTrTC<^TA.TTAX3SCC?ul 




30O 

:CACACCTTCCCGAATC 
100 0 



100 o 




_ ^ClTTACGCGTATCGATXXGCTrTAAeATXCAXT 



< . 3XP3-SGITP MARKER ^ L 



TCTCGTTGGGGrCTTTGCTC^&SSCGG? 

A<^U5CACSGGGCCGTCGCCGATGSGGGTGTTCTOT« 
< - . 33CP3-BGlTg 



1700 



^tGAlGCCGTTCTTCTGCTTGTCC^CATGAr^n^CGTTOTGGCTGrTGTAG^ 

C 3XP3-SGFP MAKKEP, ~~ 

GATGCCCrrO.(XTCG-GCGGTTCACCVGGGTGTCKCCTC 
41 — _3X?3-3GF? MARXSR 




FlCr. /6(B) Con+- 



GCTX^CTTGTGGCCGTTrXCGTCC^CSTCC^^ 
< - _3XP3-_.G_-_» MARKER 



2300 

rtSTTCJUSCrcXIGCTTGTTTATrrGCTTAGCT 
3jCP3-SGFP MARKER 



TTCAC_TT GCTTGTT-GJ^TC^ 

< - 3XP3-EGFP MARKER A 



2S0O 



A?_t_J_\__A_«C-A^^ 
< 3XP3-EGFP MARKER 



SXPS-EG - _» MARKER 



2600 



TGCGTCAATTTTXCGCATGAITATCTTXAACGTA^ 
< —EFT T—RMXNXL RE_?E?_C 



800 



C^TCTCCTCCATCACGCT — 



23QO 




AGGCSGTTrGCGTi\ITGGGCGCTCTTCC 



310O 

-CGTTCSG<_rGCGGC_&GCGG_A_^^ 

rnT.El aaiGru >, 



2200 



CGTTTT TCCAT A^SGCTCCGCCC CC C TCLA_GA^A_7CA-^_VA^ 

_. - COI_31 ORIGIN ^ 



3400 




_ . COU51 ORIGIN > 

3aoo 

SG_^GCTCTT_GA_CCGGCAAJ^\^^ 

COLEI ORIGIN ^ 

2 900 

C_"—_ATt:_T_— C_ACG<SG™ 

- C OLEl ORIGIN ; > 



4000 

_CO„l ORIGIN" | ' > 

.ZLi^. . - * . ~"_1 * . - .. . "I:, ___^k?CII__IN RESI3_?CNC3 5 



4100 

ATCTGTCIArTTCGTTCATCCAEAG'Z^^ 

AMPCILLIN RESISTANCE . > 

4200 

CGCGAGACCCACGCTCACCGG^TCCAGAITTA^ 

: AMPCILLXN RESISTANCE > 

130O 

C C^VSTCTATTAATTGTT^CGGGAAGCrAC^GTAA^^ rXGC SCAaCC I 1G1 1 GCCATTGCTACAgCCATC: C^GCPTCTCACgC 
i AJMPCrLLXN R2SI3TANC2 : . ' • • ^ 

440O 



450O 

C T CC GAT CGTT GTCAGAAGTAAGT TGGC CGCA^TGTTATCACTOXTGG^TTATGGCLT^GCACTGCAXAATTC TCrTACTGTOVTCCC^CCGTAACATGOTT 
AMPCILLIN RESISTANCE , ; > 

470O 

CAXAGCAGAAC 1 1 I^AAAA^TGCTCATCAITGGAAAACSTTCTT^ 

AHPCXLilN RESISTANCE • > 

_ 4800 

CTCGTGCACCC^ACTC^TCTTCAGC^ 

AMPCI1LIN RESISTANCE > 

4900 

GACACGGAAAT GTTGAA1S\CTCATACT^XT CCT"rTTTCAATAl^^ 

A WgCUXJK RESISTANCE > - "~ 

^EAGAAAAATAAJUCAAAfEAGG-GGTTCCGCGCACAXirTC CCCGAAAA^TGCCAC ( SEQ ID NO I 52) 



pXL-Bac-ECFP 



? -o 0 



Sad 

/ 

,% ^Xbal 
.Spel 
.3> Sacl 

- BamHl 



°o. 



0.Q 



92 OQVZ 



00 



3 stl 



'na/ -Bepoat 



Kpnl] 
Apa 
Xhof 
Sal! 
Hindlll 
EcoRI 
Pstl 
BamHl 



Hindlll 
Spel 



Notl 
Xbal 



vKpnl 
Sail 



Hlndl.ll 



FIG. 11^ A) 



FiCr. Mb) 



pI-rL-3e.c-ECFP 



Sequence Ran^e : 1 co 43 41 



AA*JTCLAJ\AAGAAXAG^CCGA^^ 



10O 
20C 



300 




CQSTGCC^^CTCTlCGCrATTACKCAGCTG<^eAMG«^GMOT 



700 




AAAGA.TAG-T CT GC G?E7*JWJ»lT TG?a3G<^T0^TCT;*A3rrAA^ 
< RIGET TEHMZHAi REPEAT 



1200 

fTGGCGGCCGCTCTAfiAACIAGT 



1400 

GAACCCTOGAXICGCCG<^GTAa^^^ir?aSAX^^C^TTC OTCXACGGAGCGACAAIT 0*i\ITCAAA^ TAAS 
_ 3XP3-Ecr? MARKER ; ^ 

MARKER 



1SOO 

CTGTTCACCGG^OTGGTGCCCAXC C TGGTC GAGC TGGACGGCGAC GT3u\ACGGCCACV^TTCAGCGTGT CCGGCG?JGGGC G^JS<GVSOG^TCC:C^CCTAC G 
. 2XP3-2CS-P XARKBR , > 



GCAAGCT GACCCTGAAGTTCArrc 



1700 

CCCGTGCCCTGGCCC\CCCTCG^GACCAT:CCrGACCTGGGGCGTGCACTGCtTCAGCCG 
3X23-ECS*P MARKSR ^ 



C TACCC C GAC CACAT GAAG CAGCAC GACTT CTTC\AGT CCGCCMGCCCGAAGGC 2 
3XP3-ECFP MARKSR 



LBOO 

3.C CATCXT crrCA.VGGACGACiSGCAA.CTAC 




AGCTGG^I3TACAACTACATC\GCC\C^ACGTCIArA 



3XP3-3C:PP MARSZ3. 



~"~ ' ' 2100 

CGGCXGCGTGCAGCTCGCCGACCAC TACCAGCAGAACACC CCCATCGGCGACGGCCCCGTGCTGC TGCCC&AC^AC CiCTACCT GAGC?.CCCAGTCCGCC 
MASKER ■> 



CTGAGC^AAGACCCC^J^GAGAAGCGCl^ 

. 3XB3-ECPP WABSTER . > 



ATGAArGOATTCTTGTTG" TAACT TGTTTATTGO 




TGCAirrro^TSTT^rTTGXCUIA^ 

3XP3-3CFP MAKK3R > 



*CAAGCTTATCGAIACCGTCGACCTCX=AG<^ 



T AC GCAT GAIT ATCT T TAAC GT^XIGTCACAATATGATTATCTTT C TAGGG TTAAT C TAGCTGCGT GTTCT<^CAGC GT GT C GA&CAT C T TCAT C T gctct ^ 
< LEFT TEKONAI REPEAT 



ACmAATTGCGTTGCGCTCACXGCCCCCTTTCCASTCG^^^ 



XolEl otigin 




AG^TCCt^CCCCCTGACGAGCMClACAAAAAI^^ 



j 4-00 

GCTCCCTCGTGCKTCTCCIGncXC^CCTGCCKTTACCGGM'ACCrOTCCGCCT^ 



3S00 

TAfiGTXTCTOXTTCGGTGTAGGTCGnCGCTCCAJ^TGGGCTGrcTG^ 



360Q 

CTTC^GTCCAACCCGGTAAGACACGACTT^TCG^ 



T GAAGTGGTG CC CTAACTACGGCTACAC liAGAAGGACAGTATT^GGTATC TGC GGT CTGCTGAAGC CAGTTACC TTCGGAAAAAGAGTT GGTAjGCTCT 



3700 



3900 



JO uu 

ATCC&^TAAACAi\ACCACCGCTGG:C\^ 



T CTACGGGGTC T( ^^GCTCAGTGGAACGAAAACTCAC GTTAAGGGATTTT GG/TCATGAX^TTATCAAAAAG<WkTCTTCACCT AGAJTCC TTTm^TCAAA 

4O00 

^TGAAOTTTTAAA^CJ^J^CTAAAGTAT^AZGA 

AMPCII^LIK AESIS-2AKCE > 



TCGTTCATCC\'E>J?rTGCCTC 



410O 

3 CC GT C GT GT AGAT AAC T AC GAUAC G<3£=AGGGCTT AC CAT CIGGCCC CAGT GC T GCAAT GATAC CGC GAGACC CA 
AKSCILIiIM ?^SI3TAWC3 > 



4200 

C 5CTCAC C GGC TC CAGATTT A7CAGCAATAAAC CAGCCAGG CGGAAGGGC C GAGC GCACAAGTGGTCCTGCAAC TTTAT CCGCC T C CATCCACTCSnk TTA 
• • ' - -A MPCIIiLiy ?^ S X 3 TANC3 - - . 



F7o. HC&) ao n ±. 



______ AMPCTU^XU RESI3TANCS "* " 



T^GOCTTCATTC^AGCTCCGGTTCCC^ rGTGca 
• AMFCXLI,IJJ R2SISTJUJCH 



a 400 



! ; AM?crLUU RESISTANCE > 

* . 

*600 

____ AMPCXXLITM R2SISTA3TCE . ■ > 

4700 

FTl^VAAAGTGCTOVrCArrGGAAAACGTrC*^^ 

. , AKPCXLriW RESISTXNCE ~ ^ 



AACTGAJXTTTCAGCajCTTl lA CT'I I CXCCAjSC GITT CT 



4300 



AMPCHLIN R > -™cw^_s«v 



^ (SEQ ID NO: 53) 



pBS-iTR-ECFP 



Kpnl 




Sad 
BamHI 
Hrndlll 



F3S-ITR-SC7P , (U . , w - - ^ Sequence Rang* : 1 to 4943 



10Q 



CACCTGAC G CGC C C T GTAGC GGC GGATT AAGC GC GGC GG GT GT GGTGGTT AC GC GCAGCG T GACC GC TACAOT T GCCAGC GC C C 7AGC GC C CGC T CCTTT 1 

CKTTTCrTCCCrTCCTTTCTC^^^ 

CTCGACCCCAAAAAACTTGATTAGGG^ 

ATAGfTGt^CTCTTGTTCCAAACTGGAACAACACTCA 

'VU^GAGCTGATTTA^CAAAAATTfT^^ 

GATCGOTGCGX^CTCTTCG^XATTACGCCA 

T TGTI^AAAC GACGGCCAGT GAATT G T AAT AC GACT CACTAT AGGGCGX^TTGGGTj^ CGGGC C CC CCC TC GAGGTCGAC GGTAXC GATAAGCTT GA^EAT C 
GAATT C CTGCAGCC C GGGGG ATC C CAJGO GT<ZAAX TTTAC GCAGACllArC T TTC T AGGGT TAATC TAG C T GCATCAG3ATCAT ATCG "TC GGCSTC TTTTT T 



_RIGHT TilKKXHAi R3FBAT 



900 

CCG^TCAGTCATCGCCC=\AGC7re^CGCTArC^ 

GAGGATGAGTGCTGCllGCAarTGACGrTGAGCGAAAACGGACGTT^ 

TTCAGGCCACCTGGGATACCAGl^COTCC^G^ 

CCGGAArrTGCCGTGCCGGTGrG^^ArT^ 

1300 

AC^TGGATACCCCGTGAOTCACCCGHSCGGCTCGCT 

1400 

^^^AGCAGATGAAGATGCTCGAC^ 

I»EFT TEBMINAI, REPEAT ^ 

1300 

AAATTG^GCATGGGA^CACT!AGTGTTCCCACAATGG.TT 



_3XP3-SCFP MARKER 



1700 

AA<^\CGTCGCTAa^GAAAGCTAAGG^ 

1 ] 3XB3-KCFP MARFSR ■ > 

laoo 

CACCAT GGTGAGCAAGGGCGAGGAGCTGTTCACC GGGGTGGTGCCC^TCCTGGTCGAjGCT C^AGGGCGACGTA^ACGGCCACAAGTTCAGCGTGTC CGGG 
3XP3-ECF? MAAXKR > 

1900 

GA£GGCGAGGGCGATGCCACCTACG<3GW3CT^ 

, 3:t?3-2CFP bCkSXSR > 

2000 

C CTGGGGC G^GCAGTGCTTC AGCCGCTAC C CCGAC CACAT GAAGCAGCAC GACTT C T TCAA&T CCSCCAT GCCC GAAGGCT AC GT C CAGGAGC GCAC CAT 
3XP3-2CFP MARKER v> 

2100 

CTTCTT CAAGGAC GAC GSCAAC TACAAGAC C CGCGCC GAGGTGAAGTTGGAGGGG GACAC C CTGGTG^ACCGGATCGAGCTGAAGGGGAXCG\C 7T CAAG 
- ' - • -- 3XP3-5CFP MARKZR."!^- • • ..." ^ 



fiOr. Mb) cW. 



&AGGACG<K^ACAXCCTGGGGCA^^ 

. ; 3XP3-3C?p MASiCES. " ^ ^ ' ' 

TC^^CCTCCACLAACXTCGAG^ 

3XP3-SC5T MARXSR _____ > 

CCACTACCTGAC^CCAGrCCGCC^^ 

. 3XP3-SCF? MARK2R_ > 

G<*^GGACGAGCTGXACAAGTAAAGCG^ 



CCTCCCCCTGXACCTGAAACAE 2 ^^ 

3XP3-EC3TP MARKER > 

rcCACAAAIAAAGCAXTTTT--C^ 

. 3XP3-5CFP MARXSR _ ^ > 

CCGGCCGATACTAGTTCE^^ 

TAGCTGTTTCCTOTOTGAAArrrtrTTJ^ 

AATrrCACATTAAXTGC GTTG C GCTCACTGC CC GCTTT?CC?U^TCGGGAAAC:CTGTC GTGC CAGCTGCATTAAT GAATCGGC CAACGCGCGGGGAGAGGC^G 

>Colsl_oxig_n 
I 

TTT!GCGTATTGGGCGCTCTTCCGCTr^^ 
TCC^AC^TCCGCCCCCCTG^ 

rGGAA£KrrCCCtCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCC^^ 

3500 

CGCTGTAGGTATCTC^GTTCGGTC-AG^ 
ATCSTCTTGAGTCCAACCCGG^A^ 

3300 

TC^GATCCGGCAAACAAACCATICGCT 

TCTTTTCTACGGGGTCTG^^GCTCAGTGGAACGAAAACrrc^^ 

4000 

?TAA*AAXGAAGTT^AAATC^TCTA?A£^^ 

; AJiPCILilJI RES ISTAHC2 > 

aioo 

C ~ ATTT CG7 T C ATC CAT AGT TGCC TGAC C CC GTP CGTG3L\GATAACTAC GA2IAC GGGs\GGGCT"tACCAT CTG^SCCCCAGTGC?GCAAJGA!EAC CC^GA G 
. AMPCI2.IXN ISTANC2 > 

4200 

ACCCAC GCTCACCGGCXCCAGATT TATCAGCAAXAAAC CAGCC^G.CC£GA2>GGGCC,3A^ - . 
AKPCXXLZN HSSXSTANCS ~ 7~ 



F/&. JSCs) cbrff. 



_AKPClLLrN RESISTANCE ' ^ 

^^ATGCCTTCATTCAGCTC^ 

. AMPCILim RESISTANCE ~ ^ 



AMECILIiIK RESISTANCE * " > 

C^CTC^TGAGTACTCAACCAAGTCATTCTGAGAATA&TCTATGCG 

AM3>CXiZ.XN- RESISTANCE 

A KPCTUON RSS I5XANCK _ ~ > 

CACCCAACTGATCTTejU^TCT^ 

— ___AMpcruLiN assisTANCs ' ^ 



aatxaacaaatajgg?^ttccg<^^ ( SEQ ID NO : 54 ) 



PSS-ITR-ZGF? 



fie.. /9(b) 



Scqxrcrscc Sisge : 1 to *9^4 



CACCTGACC^CCTGT^ 

CGCTTTCTTCCCTrCCTTTCTCGCC-^CGTTCGCCGGCTTTCCCCGTCAAGCrcrAAATC 
C ^^CCCCAAAAAACTTGim^^ 

ATAGTCGACTCTTGTTCCAAACTGGAACA^CTCAACC 
AAATGAGCTGArrTAACAA^WTtAACG^ 



GATCGGTGCGGGCCTCTTCGCai^T 



^CGGCCAGTGAATTG^AAO^CGACTCACT^ 



GAATTC C TGCJLGC C C GGGGGAT C C CATGC GT CAA3? TT TAG GCAG^i 



300 

RIGHT TERMINAL REPEAT > i "* T 



CCGGCTCAGTCATCGCCCAAGCra^^ 



GAGGAXGACTGCTGCTGCATTGACGTTGAGCGA^AACGCACGTTTAC 



XTCAjSSCC^CCTGGGAS^CAGTTCGTC 



1000 

GGXTTTTAACGGTGaAjCTGTTCG- 



1100 



CCGGAACTGCCGTGCCG^TGTGC^^^ 



ACATGGAIACCCGISTSftGTTACrCS^GGC Tl^ 1 11TJ\T-PCACGTT r iU T ^ AIICCGTGXSU5g^GGGCasa.CTCGC 



1300 



1400 



TGGAGCAGATGAAX^GCTCSACA^ 

. LEFT TERMTNAL REPEAT > 

AAA-H^CGCAXGGGATCC 

: 3XP3-EGFP MARKER ^ 




1800 



CCACCGGTCGCCACCATGGTGAGC&>JSGSCGAGGA 

■ . 3XP3-EGTP ^ 



GCGTGTCCGGC GAGGGCGAGGGCGATGCCACCXAC GGCAAXSCTGACC CTGAAGTT CATCT GGACCAC CSGCJ\AGCTGOC CGTGCCCTC-GCCC? CCCTCGT 
_3XP3^3Grp MAP.K5R , ~ > 

^^^ c -^'- < -^ ^ - AC GGCGTGCAGTGC TTCAGC C GC TACCCCC?^CC?XLAXGA^C^GO.C GACT TCXTC AAG T CCGCCATGCC CGAAGGC TAC GrCGAG 
_ _ 3XP3-£GFT> MARKER 



G A&C GCAC CAT C TTC T T CAAGGAC G AC GGCAACT ACAAGAC C C GC GC C GAGGT GAAGTT C GAGGGC GACAC C C T GGTGAAC C GCA2 C C^GC TGAAGGGCA 

lilz: •" ' jjCP3-bgpp >ggik^:r -- --- • • . . 



FlQr. /9(B) CCA+. 



TCG^TTCWG^^^GGCAACATCCT^^ 



ca&ggt(3aacttcaagatccg^ 

3XP3-SGF? MARKER ' ^ 



ctgcccgac^accactacctgagcagicag: 




25O0 



GGATa\CTCTCGGCATG<^CGAGCTGT?UIAJ\G^ 



_2XP3-EGFB MARKER 



AACCTCCCACXCCTCCC^CTG 



GCATCACAAATTTCACAA^TAAAGCATT 

3XP3-BGFP HARXER ; ; * ^ 

230O 

^TGiAjSGGTXAATTTCSAGCTT<5<5CGTAAICAXt3<3TC 

2900 

A^£CTGrTTCCTGTGTGAAAXTGTTATCCG<rrC^ 

3000 

TAACTC^CATEAA3TGCGTTGCGCT^CIGCCCGCTrrCCAGTCGGG?l^ 

>CoX EX_ca jr igJLa 
I 

1 3100 
&rTTX3CGTATTGGGCGCTCTTCCGCTTCCTCGPTCAC^^ 



3200 




3400 

CTG<iAAGCTCCCTCGTGCGCTCTCCTGTTCCGAC^ 



3500 

ACGCTG1^i5<7rATCTCAGVITCGOTGTAeOT 

360O 

^^GTCTTGAGTCCAACCCGGTAAGACACGAC?^ 

3700 

AGTTCTTGAAGTGGTGGCCTAACTACGGCrACACIL^^ 

3800 

CTCTTGATC^C^^AAACAAACCACCGCTGGT^^ 



.3300 

ATCTTTTTCT^JCG<^GTCTGACGCTCJ^ 

4000 

ATTAAAAATGAAGTTTTAAATCAAXCTAAAG^ 

AWPCILI.IN R£SISTA*TC3_ :> 

4100 

- c cat t 7 c g t? cat c cat ag tt gc c tgac tccccgtcgtg tagataac 1acgatac gggagggc t tagcatctggccc cagt gc tgcaatgatac c gc ga 
; ; ampcixlIiIn resistance > 



GACCCACGCTCACCGGCTCC?u^^ 

AMPCILLIN RS3T5XANC3 > 



FiG-. HCB) co^rf. 



CTATTAATTGTTCKTCGCX^GCT^GA^ 

^ , AHPCILLIK RESISTANCE . . 



^TTTC^TAXGGCTTCAXTCAGCT^^ 

- ; A MPCTT.T.TN AKS 1 3 TANC3 *" ^ 

ATCGrTGT CAGAA£jTAAGTTGGCCG<I?^rrGTXATCAC rCATGGTTATGGCAGCAC TGCATAATTC rCTTACTGXCAT GC CA.TCC Gf£L*J»iG^VJGCT ' T, T^C TG- 
A MPCTI,T>-m RESISTANCE ~ > 

__AMFCELLIN RESISTANCE^ ^ > 

GAGAAlx^j;>kAAAGTO=T^ 

: . . ' . AMPCn.T.TN RSSl3rAHC3 4 . • ( . ■ • 1 . > 

4800 

G<^CCAACT<^CTTCAGCAXCriT 

j AMPCnXLO? RESISTANCE [ > 

. A KPCILLIN RESISTANCE > M 

AAATAAACAAATAGG<5G^CCGCGCACATTrCCCC GAAAAGTGC ( SEQ ID NO 55) 




FIG. 20 (A^> 



v) 



DB3-ITR-EY?? 

ScqMcncs Range: 1 to 43*\% 
^CCTGACGCGCCCTGTLXGCGC^GCAX^^ 
CKrTTCTTCCCrTCCTTTCTCC^^^ 
CTCG^CCCAAAAAACTTGATlMGGreATGGTTC^ 

AAATGAGCXGATXTAACAAAAArTTAACGCGAATTX^^ 
<^CGGTGCGGGCCriX:rTCGClV^^ 

T ^ rG ^ lt ^^ AC ^- C CAGXGAAXT G TAAT AC GAC X CACX AXAGGG C GAATTGGGT ACCGGGCCCC CC CTCGAGG TCGAC SG'TAT CGAXAAGC IT G?K.TAT C 



aoo 

CGGGTCTTTTXT 



90 0 



CCGGCTCAGTCXATCGCCCAAG^ 

CJAGGATGACTGCTGCTGCAITGACGTTGAGCGAAAACGCAC^ 
rXCAGGCCACCTGG^.XACCAGXXCGXCG^ 
CCGGAACXGCCGTGCCGGTGTG<iaGAXT^TGACAGCG^ 
AC^TGGA£IL\CCCCGTGAGTTACCCGGCGGCTCGTTGAXrCACGTTT^ 



1300 
GCGTGA 



TGGAlSC^ATGAASATGCTCG^ 

L3FT TEFMINAI. RKPSAX > 



AAATXGACGCMG<S<^XCCACTAG^^ 

3XP3-SXFP KARKER 



ATTAGGATCCAAGCTXATCGATTTCGAACCCrrCG^ 

. _3X?3-SY7? MARKER 



AACACGTCGCTAAG^GAAAGCTAAGCAAASZA^ 



1700 
:ggtcgc 

_3XP3-£YFP MARKER _ > 

1BQ0 

( ^ CC AXGGTGAGCAAGGGCGAGGAGCTGTTCACCGGGGTGGTG 

, 3XP3 - ZY5~? MARKZR > 

1900 

GAGG<^GAGGG^GMGCCACCTACGG<^AGC^ 

. 3XP3-SYX1? HARXSR > 

2Q00 

GCTACGGCCTGCL\GTGC?TCGCCCGCTACC^ 

__ 3X_?3-_T_T? MARKSR > 

2100 

CTXCXTCLAAGGACGACGGCAACll^CAAGACCCGCGC^ 

— ■ - -3X#3-3Y?P- "MARKER ------ ■ - ■ Z^j-Z > 



j 



f/c-- '30(B) ccn-f. 



2200 

GAGG AC GG CAACAT CC T G-G^G<S<GVCAAGCT GGAGTACAAC T ACAACAGCCACAACG TC TAT ATCAT GGC C GACAAGCAGAAJGAAC GGCAT CA AGGT<GAAC T 
3XP3-3YFP MARXSR ^ 

"230O 

TCAAGATCCGC CACAACATQ5AGGACGGCASCGTGCAGCT CGCCGA^CCACX^CCAGCAGAA.CAOICCC^XC&GCGACGGC CCCGTGCTGC TGOC CGACAA 
3XP3-E*7P MASKER ^ > 

2400 

CCXCTACCTGAGCTACCA^CCGCCCTGA&CAA^ 

_3XF3-SYFP MARKER ; > 

2500 

GGCAT GGAC GAGG T GTACAiAGTAAAjGCGGC CGC G AC TCTAG AICAlLXATCAi3CCATACCA^\TTTG TAGAGG XT TTACXT GCXT TAAAAAACCT CC CACA 
3XP3-ZYFP MASKER > 

2600 

CCTCCCCCTGAACCTeAAACATAAA^ 

MARKER ' > 

2-700 

XTCACAAAXAAAGCATTrcm^ 

■ MARKER ^ 



2300 

CCGGCCGAX , CAJCCC^7XrCG^UiAGC^G^ 

> * 

2300 

ATAGCTGTTTCCTGTGTGAAAXTG^ 

30 0 0 

TAACTCACATTAATT^CGTTGCGCTC^ 

! 

GTTTGC GXATT GGGCGCTCTTCCGCTT C CT C GCTCACTGA£rTCG<rTGCG<^CGGTCGTTCGGCTGCGGC 

3200 

ACGGTTATCCACA^GAATCAGGG<»^JkC^ 

3300 

TTCCATAGGCTTCCGCCCCCCTGACGAGCATC^^ 

3400 

CTG<^A.C^TCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCG^ 

3500 

ACGCTGT7AG0TA^CTCAGTTO3GTGHI3^GTCGTTC(^ 

t 3600 
TATCGT CTTGAGT CCAAC C CGGTAAGACAC GACTT ATC GC CACTGGCAGCAGC CACTGGT AACAjGGATTAi3CA&AGC GA.&3TAX G T AGGC GGTGCTAGLAG 

3700 

AGT^CTTCAAGTGOTGGCCTAAClLXCGGCrACACXAGA^ 

3300 

CTCTTGAI CC GGCAAAjCAAACCAC c gg t gg tagc ggt ggt tttttt GTTT GCAAGCAGCAGAX TLVTGCG CAGAAAAAAAGG atc tcaagaagat cctttg 

3900 

ATCTTTTCTAC GGGGT CTGAC G CT CAGTGGAAC G AAAACT CACTtTITAAGG^GAX m GGTCATGAGATTAT CAAAAAGGATCTTCAjCCTAGATCCTT TTAA 

Aooo 

AXTAAAAATG AAGT TT T AAAT CAAT CT3LAA&TATATAJ GAGTAAACTT&GT CTGACAGTTACCAAcXGCTTAATCAGTGA.GGCAC CTATCTCAGC GAXCTG 

AafPCIIim RESX5TANCE > 

4100 

XCTATTTCGTT CAT C CAT AG TT C TGAC T C C C C GTC &X G XAGAOGVACTAC GAZ AC GGGAGGG CG TACCAT CTGGCC CCAST GCT GCAATGA5CAC C GOGA 
. AH3PCILLI1T RESISTANCE > 



— ■ " — " • — ■ • • - - • 4200 

GACCCA.CSCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCA^ 

AMPCIiLZN RESISTANCE > 



FlO: <30(B) coni. 



43QQ 

CTATTA^TGTTGCCGGGAAGCTAG-^^ 

, AMPCIL-IJT RESISTANCE ^ > 

440(3 

GTTTGG_^GGCTra\XTCAGCTCCGOTTCC_VACGATCAA^ 

AHPCIIXIN RESISTANCE " > 

aTcgttgtcagaagtaagttggccgc^ rrrcTc. 

^AKPCIXLIN RSSI5TAWCE_ " > 

T<S\CT3GTSAS_?LCTCAACCA3^ 

. AM3?CXI»_IN RESISTANCE ' > 

'; ' ■ * ! - * ^700 

CAG^CTTTAAAA~gCTCATCA!ra 



> 



4300 

GCAC_^AACTGAIOTTC_\GC^^ 

AH_?CXL_IN RSSISTANC B ' > 

4300 

GGA^^GTrTGAATACTCATACTCT^ 

AMPCXiLIN ASSISTANCE > - 

AAATS-AAiCAAAIJ^ ( SEQ ID NO I 56) 



pBSll-Act5c-orf 




FIG. 21 (A) 



pBS II-Acuic-orf 



Sequence Range: I co 7411 



100 

C TAAAT TG^XAAGC GTTAA^TTTTGTT1AAA>.TT CGC GTTAAATTTTTGTTAAAT CAGCTCA T 'iTTT TAAC CAAJAGGCC GAAAT CGGCAAAATC CCTTAT 

200 

AAATCAAAAGAATAGAC CGAGAXAG GGTTG AGT GT TGTTC CAGT T T GGAACAAGAGTCCACTATT AAAGAAC GT G<5^CT CCAACGTCAAAG33C GAAAAA 

300 

400 

^ CCCCGATTTAGAGCTTGAC GGGGAAAGCC GGC GAACGTG€CGAJSAA?lGGAAGGGAAGAAAGC GAAA 1 )GGAGCGG<^CKITiU3<3<KZGCTG<SCAAGT CEAGCG- 

Soa 

GTC\CGCTGCGCGTAACCACC^CACCCGCCGCGCTTAA^ 

600 

cggtgc gggoctcttcgciattacgccagctgx^gaaa^ 

700 

TAAAACGACGC^X^JSTGAGCGCGCGTA^ 

aoo 

CAATTCTAAAAAAAAXCATGAATGGCATCJVVCXC 

ACT IN SC PROMOTER ■> 

900 

CTGCCTGTTATGTl£G^CC^jCAAACCAAGAC^ 

ACTIN 5C PROMOTER > 

1000 

AAagaagcgatcaaagaagcgtcttcgggcgagtaggagaaxg^ 

[ actxn sc pkmoter > 

1100 

ccaactg<k:catattcggagcaaxttgaa^ 

act in sc promoter > 

1200 

AAAGAC TC GTC GGTAAGATAAGAXT GTGT CACTCGCTGTCT C TCTTCATTTGT C GAAGAATGCTGAGGAATTTC C^GATGAC GT C GGCGACTATTT TGAA 
. ACTXN SC PROMOTER > 

1300 

GAArGAGAATAATTTGTATTTATACGAAAATC^AGTTAGTGGAAT^ 

^ . ACTIN 5C PROMOTER - > 

1400 

ACAAAGATTGTATG TATATAC C TT TAAT GT ATT CT CATTT TCTTATGTAT TTATAATGGCAAT GATGAT>GTGATGATATTTTZAAG?VTG : iVT GC CAGACCA 
ACTIN 5C PROMOTER > 

1SO0 

CAGGCT GATTT CTGCGTCTTTTGCC GAAC G<ZA£T GCATGT GC TG ITi'G 1 TT 1 TTGGAATAGTTTCAATTrTCGGACTGTCCGCT^ 

ACTXN SC PROMOTER > 

i " 

1600 

T!H5GCTTATTCAAAAAGCAAAGTAAAG^ 

; , ; ACTXN SC PROMOTER > 

1700 

GAAGGGTGGGGCGGGGCGTGGCGGGGTCTGTGGCTGGGCGCGACGT^ 

ACTXN 5C PROMOTER > 

' 1300 

TCTCC<:CACTCGCCeGTTGTTTTTTTCTTTTTAXGT^ 

ACT IN. 5C PROMOTER : > 

1900 

G TCATT TTG GC T C G AAAX AG GCAATTATTT AAACAAAGAX TAGT C AAC ^AAAAC GC TAAA^^XAAAT AAGT C T ACAAXAT GGT TAC T TAT TGC CATGT GT G 
_ ACTTN SC PROMOTER > 

2000 

TGCAGCCAACGAXAGCAACAAAAGCAACAAJ^^ 

ACTIN'SC PROMOTE " ;_- " > 



Fib: £(CEbc*& 



2100 

CAATTAC GC T GACA^AGAGCAGAC GAAGTTT TGGC C GAAAAACAT CAAGGC GCC TGATAC GAATGCATTT GCAATAAGAA7T GC G ATATTTAAT ATT GTT 



ACTTN SC PROMOTER > 



-2200 

TATGA\GCTGTTTGACTTCAAAACACA^ 



ACTIN 5C PRGMDT3R _ > 



230O 

ACAX:CGACACGAAATAG<rrTAjCCTGACGTCA^^ 

ACTIN SC PROMOTER _ > 

240O 

GCSTAGTAGGCATG^GGTTGTGTGTGAGGCGTCTCTCTCTTCGTCT^ 

ACT IN 5C PRGMOTER > 

250O 

TGAATGAATATTTt*CACTATTACTATTCAAAACT^ 

_ACTIN SC PROMOTHR . > 

2500 

ATTT GAGCAAAAATGC TGCATATTTAGGACGGCAT CAT*IL^\GAAAXGCTTCTTGC TGTGTACTT 'rTCTCTCGT CTGGCAGGTGTTTCGCCG^rX^TTGa^r 
ACTIN SC PROMOTER ; ■ , ~ " > 

2700 

AAAACC GGCTTAAGTTAGGT GTGTT1 TL LACGACTAGTGATGC CC CT?jCTAGAAGA1 fcrrGT G'lTGCACAAAT t^i'<_l_ C'IKSAATAACCAATTTGAAGT GCAG 
_____ ACTIN SC PHOMOTRR " > 

2300 

ATAGCAGTAAACGTAAGCTAAIATGAATATTATTTAA 

ACTIN SC PROMOTER > 

2900 

ATG T T TT GGCATACAAT GAG-TAGTT GGGGAAAAAAT GT GT AAAAGCACC GTGACCAXCACAGCAT AAAGATAAC CAGCT GAAGT AIC GAAT AT GAGTAAC 
; _ACTIK SC PROMOTSR > 

3000 

ccccaaattgaatcacazgccxscaactgaj:^^ 

ACT IN SC PROMOTER [ > 

>CCATAIAXGG element 
I 

I 3100 
GGAAAT TCT CC GTAAA TGAAAACC CAAT C GGCGAJUZAATT CAIAC CCAIATATGGTAAAAGTTTT GAAC GC GA.CTTGAG AGC GGAG^iSCAT'TGCGGCTGA 
A CTXM" 5C PBOWDTHR > 

>TATA-box 
I 

I 3200 
lAAGGTTTTAGCGCTAAGCG^GCTTTAT^^ 

IFF2 ORF SAMKI CARTRIDGE: > 

ACT IN SC PRbrfOTER > ' 

j 

3300 

ATAICCTCTCTGCTCTTCTGCAAAGCGATGAC^ 

• : IP? 2 ORF RAMHl" CARTRIDGS > 

3400 

T ACAGAAGAAGCGTTTATAGAT GAG GTXCAT GAAGTGCAGC CAAC GTCAAGCGGTAGTGAAATAXTAGAC GAACAAAAXGTTATTGAAjCAACCAGGTTCT 
IF? 2 ORF aAMHI CARTRIDGE > 

3500 

TCATTGGCTTCTAACAGAATCTTt^CCT^ 

IFP2 ORF 3AWKI CARTRIDGE > 

3S00 

T C T C T GCAC TGAACATTGT CAGATC TCAAAGAGGT C C GAC GC GTAT GT GC C GCAATATATATGAC C CACT TTTATGC TT CAAAC TL^TTTTTTAC 7 GAT G A 
; IF? 2 ORF BAWSX CARTRIDGE > 

3700 

GATAAT TTC GC^A^T GTAA^ATGG ACAAATGCTG AC^TATCATT GAAAC GT CGG GAAT C TAT GACAGGT GC TACATTTCGTGACACGAAT GAAGATG^A 
" " " "\- IT? 2 ORF SAttEI CARTRIDGE-..' ; ' ' .-- ■ ■ > 



Pi Or: e?/C$)- coot. 



380O 

ATCZLATGCTTTCTTTGGTATTCTGGTiAATGAI—^ 

rr?2 orf a^KHi cartridge m > 

3900 

T C TCP G TAAT GAGTCGT GAT C GTTT T GAT TTT T TGATACGAT GTC TTAGAATGGATG_\CAAAAGT ATJU_ GGC C CACACTTC GAGAAAAC GATGTAT TTAC 

IP? 2 ORF BAMHI CARTRIDGE > 

400O 

TCCTGT TAGAAAAATATT5GGATCTCTTTAT 

I ST 2 ORF BAMHI CARTRIDGE > 

4100 

GGACGGTGTCC GTTTAGGAT GXAXATCCCAAACAAGCCAAGTAAGTAXGGAA3IXAAAAT CCrCAXGATGTGT GACAGTGGTACGAAGTA2IATGA!IAAA!rG 
I2T2 ORF BAMHI CARTRIDGE > 

4200 

SAATGC CTTATTTGGGAAGAGGAACACAGACX^^ 

IF?2 ORF BAMHI CARTRIDGE ; ; > 

430O 

TATTAC GTGTGACAAT T GGT T CACCTCAAT C C CTTTGGCAAAA^ACTT ACTACAAGAA£1C GTATAAGTTAAG CATT GT GGGAAC CGTGC GAT CAAACAAA. 
IFP2 ORF BAMHI CARTRIDGE ; > 

4400 

CGCGAjSATACCGGAAGTACTGAAAAACAOT 

_ IF?Z ORF BAMHI CARTRIDGE > 

4500 

AGC CAG<TT?U\GATGGTATAC TTAXTAI C_\X CTTGTGATGAGGAXGCT T CTAICAACQAAAGTACC &G r HAAAC C G CAAAX GGTTATGTATTATAAXCAAAC 
1FP2 ORF BAMHI CARTRIDGE > 

4 600 

XAAAGGCGGAGTGGACACGC TAGAC C_*AATGTGTTCTGTG_\I_ACCT GCAGTAGGA\AGACGAAIAGGTGGCCTAT GGCATTATTGTACGGAAT GATAAAC 
IFF 2 ORF BAMHI CARTRIDGE > 

470 0 

ATTC CC TGCATAAATT CTT T TATTATATAGL^JGCCAIAAT GT CAGT A GCAAGGGAGAAAAGG-TT CAAAGTC GCAAAAAAT TTAT GAGAAACC TTT ACAT GA 
" IFP2 ORF BAMET CARTRIDGE _> 

4300 

GOZTGACtSTCATCGTTTAIG CX£^^ GAAGTGCCTGG 
- IFF2 ORF BAMHI CARTRIDGE > 

4900 

TAC^\XCA<_ATt3ACACT^ CXAAAAXAAGGCGAAAGGCAlAAXGCA^ 
IF? 2 ORF BAMHI CARTRIDGE > 

5000 

AAAiTGCAA_2_\AAJGTT^T^ 

IFP2 ORF BAMHI CARTRIDGE > 

j 5100 
AAGd TAAXTAC TTAT T TTAT AATACAACAT GACT GTTTTTAAAGTACAAAATAAG-' TTATTTTTGTAAAAGAGAGAAT GTTTAJ\AAGTTTTGrTAi^TTTA 
IFF 2 ORF BAMHI CARTRIDGE > 

5200 

GAAGAA^OrTTGAGTT^TGTTT - 

IFF2 ORF BAMHI CARTRIDGE > 

S3O0 

GGT GGAGCTCCAGCTT TTGT TCCCTTTAGT GAGGGTTAATTGCGC GCTTGGCG—AATCATGGT CAT AGCT GTTT CCTGTGrGAAATTGT TATC CGCT CAC 

S400 

AATTC CACCACAAjCATAC GAG C CGGA^CAT AAAGT GT AAAGCCTGGGG^GC CTAAT GAGTGASC T AACTCACATTAAT TGCGTT Gd^CTZ CACT G C C CGC T 

5500 

T TCCAGT CGGG AAAC C TG TCGTGCCAGCTGCATTAATGAATCGGC CAACGCGCGGGGAGAGGCGGTTTGCGTAT TGGGC GCTCTTCCGCTTCCT C-3CTCA 

5S00. 

CTGACTCGCTGCGCTCGGTCGTTCGGCTGC GGCGAGC GGXATCAGCTCACTCAAAGGCGGTAATACGGTTATC CACAGAAICAGGGGATAAC GCAGGAAA 

----- • - - . • S - 0Q ; 

GAACAT GTGAGCAAAAGGC CAjSCAAAAGGC CAGG AAC C GX AAAAAGGCC GCGTTGCT GGCGTT T T T C CATAGGO TC C GC C CC C C TG AC GAGCAT CACAAA 



S30O 

AATCt^CGCTCAAGTCAGAGGTGGCGAAACCCGAC^ 

? 5900 

TGCCGCTTACCGGAIL^CCTGTCCGCCTTrCTC^ 

sooo 

CTCCAASCTGG^CTGTGTGCACG^CCCCC^^ 

blQO 

TCGCCACTG^SCAGCAGCCACTGGTAJ^^ 

6200 

GAAGGAiZAG7ATTTGGT?vZCTGC 

6300 

TG VJLT rTTT I GTTTGCAAGCAGCASAT^^ 

64 00 

AAjrrCA^PTTAAJGGGATTTTGCT^ 

6500 

ATGAGTIAAACTTGOTCTGACAGOri^ 

6600 

CGTGTAGAXAXCT?JCG?J!LACGGGAGG<3C^ 

.. 6700 . 

aaoo 

CGC C&iTT^\AT AGTTTGCGC^\ACGT T GTTGC eATTTGC TTiCAgCCAXC GOT GGTGTCACGCT CGTC GTTTGOX?a:GGCTTCAXTCAJGCTC C 6GTTC CCAAi: G 

6900 

ATC^AGG C GAGTTAiZATGATCC C CCAXGTT GTGCAAAAAAj^GOTTAGCTCCTTC GGTC CTC C GATCGXT GTCAGAAGT AAGTT GGC C GOAGT GTTATCA 

7000 

CTCAIGGTTATGGCAGCAjCgt^ZAXAA^ LOl GXGAAT 

7100 

AGTGT3JPGCGGCGACCG*g l ^ ^ ^ 

7200 

TTCGGGGCGA?JACrCTC^GGAICTT>JCCGCTGT^ 

7300 

AGCbl IT l-n3GGTGAGO\AAAACAGGAi>JSG<=^ 1 IIT CMT 

7400 

aaaagtgccac (SBQ ID NO: 67) 



Sequence Range: 1 to 10333 

100 

AAGCTTGGGCTGCAGGTCGACGGATCCAAATTCAACAAACAATTTATTTATGTTTAT 
TTATTTATTAAAAAAAAACAAAAACTCAAAATTTCTTCTAAAG 

200 

TAACAAAACTTTTAAACATTCTCTCTTTTACAAAAATAAACTTATTTTGTACTTTAAA 
A AC AGTC ATGTTGT ATT ATAAA AT A AGTA ATT A GCTT AACTT 

300 

ATACATAATAGAAACAAATTATACTT ATT AGTC AGTC AG AAACAACTTTGGCACATA 
TCAATATTATGCTCTCGACAAATAACTTTTTTGCATTTTTTGC 

< : '. : PIGGYBAC ORF 



400 

ACGATGCATTTGCCTTTCGCCTTATTTTAGAGGGGCAGTAAGTACAGTAAGTACGTT 
TTTTCATTACTGGCrCTTCAGTACTGTCATCTGATGTACCAGG 

< ■ PIGGYBAC ORF 



500 

CACTTCATTTGGCAAAATATTAGAGATATTATCGCGCAAATATCTCTTCAAAGTAGG 
AGCTTCTAAACGCTTACGCATAAACGATGACGTCAGGCTCATG 

< '. : PIGGYBAC ORF 



600 



TAAAGGTTTCTCATAAATTTTTTGCGACTTTGAACCTTTTCT^ 
ATGGCTGTATATAATAAAAGAATTTATGCAGGCAATGTTTA 
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• ) 



PIGGYBAC ORF 



700 



TCATTCCGTACAATAATGCCATAGGCCACCTATTCGTCTTCCTACTGCAGGTCA.TCAC 
AGAACACATTTGGTCTAGCGTGTCCACTCCGCCTTTAGTTTG 

<- — PIGGYBAC ORF 



800 

ATTATAATACATAACCATTTGCGGTTTACCGGTACTTTCGTTGATAGAAGCATCCTCA 
TCAGAAGATGATAATAAGTATACCATCTTAGCTGGCTTCGGT 

K PIGGYBAC ORF 



900 

TTATATGAGACGAGAGTAAGGGGTCCGTCAAAACAAAACATCGATGTTCCCACTGG 
CCTGGAGCGACTGTTTTTCAGTACTTCCGGTATCTCGCGTTTGT 

< , PIGGYBAC ORF 



1000 

TTGATCGCACGGTTCCCACAATGGTTAACTTATACGGTTCrTGTAGTAAGTTTTTTGC 
CAAAGGGATTGAGGTGAACCAATTGTCACACGTAATATTACG 

. < . PIGGYBAC ORF 



1100 



ACAACTACCGTGCACAGGCTTTGATAACTCCTTCACGTAGTATTCACCGAGTGGTAC 
TCCGTTGGTCTGTGTTCCTCTTCCCAAATAAGGCATrCCATTT 



.PIGGYBAC ORF 
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1200 



ATCATATACTTCGTACCACTGTCACACATCATGAGGATTTTTATTCCATACTTACTTG 
GCTTGTTTGGGATATACATCCTAAACGGACACCGTCCTCTAA ' 

K - PIGGYBAC ORF 



1300 

AACCAAGTAACTGTTCATCTATGGTCAAATGAGCCCCTGGAGTGTAATTTTGTATGC 
ACTGATGGATAAAGAGATCCCATATTTTTCTAACAGGAGTAAA 

<— : . PIGGYBAC ORF 



1400 



TACATCGTTTTCTCGAAGTGTGGGCCGTATACTTTTGTCATCCATTCTAAGACATCGT 
ATCAAAAAATCAAAACGATCACGACTCATTACAGAGACGTAC 

< : ___PIGGYBAC ORF 



1500 



ACCATTGACAAAGATCGATCAAAGAGGTCATCTGTGGACATGTGGTTATCTTTTCl'C 
ACTGCTGTCATTACCAGAATACCAAAGAAAGCATAGATTTCAT 

< — PIGGYBAC ORF 



. . 1600 

CTTCATTCGTGTCACGAAATGTAGCACCTGTCATAGATTCCCGACGTTTCAATGATAT 
CTCAGCATTTGTCCATTTTACAATTTCCGAAATTATCTCATC 

<- . PIGGYBAC ORF 



1700 

AGTAAAAAATAGTTTGAAGCATAAAAGTGGGTCATATATATTGCGGCACATACGCG 
TCGGACCTCTTTGAGATCTGACAATGTTCAGTGCAGAGACTCGG 
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_PIGGYBAC ORF 



1800 

CTACGCCTCGTGGACTTTGAAGTTGACCAACAATGTTTATTCTTACCTCTAATAGTCC 
TCTGTGGCAAGGTCAAGATTCTGTTAGAAGCCAATGAAGAAC 

< PIGGYBAC ORP 



1900 

CTGGTTGTTCAATAACATTTTGTTCGTCTAATATTrCACTACCGCTTGACGTTGGCTG 
CACTTCATGTACCTCATCTATAAACGCTTCTTCTGTATCGCT 

< PIGGYBAC ORF 



2000 

CTGGACGTCATCTTCACTTACGTGATCTGATATTTCACTGTCAGAATCCTCACCAACA 
AGCTCGTCATCGCTTTGCAGAAGAGCAGAGAGGATATGCTCA 

< ; PIGGYBAC ORF 



2100 

TCGTCTAAAGAACTACCCATTTTATTATATAGGATCCCCGACACCAGACCAACTGGT 
AATGGTAGCGACCGGCGCTCAGCTGGAATTAGGCCTTCTAGAC 
< PIGGYBAC ORF . 

2200 

CGCGGCCGCAGATCrGTTAACGAATTCCCAATTCCCTATTCAGAGTTCTCTTCTTGTA 
TTCAATAATTACTTCTTGGCAGATTTCAGTAGTTGCAGTTGA 

<_ "HSP 70 

PROMOTER 

2300 
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TTTACTTGGTTGCTGGTTACTTTTAATTGATTCACTTTAACTTGCACTTTACTGCAGAT 
TGTTTAGCTTGTTCAGCTGCGCTTGTTTATTTGCTTAGCTT 



< 



PROMOTER 



_HSP 70 



2400 



TCGCTTAGCGACGTGTTCACTTTGCTTGTTTGAATTGAATTGTCGCTCCGTAGACGAA 
GCGCTCTATTTATACTCCGGCGCrCTTTTCGCGAACATTCGA " 



< 



PROMOTER. 



_HSP 70 



2500 



GGCGCGCTCTCTCGAACCAACGAGAGCAGTATGCCGTTTACTGTGTGACAGAGTGA 
GAGAGCATTAGTGCAGAGAGGGAGACCCAAAAAGAAAAGAGAGA 



< 



_HSP 70 



PROMOTER ■ 

2600 

ATAACGAATAACGGCCAGAGAAATTTCTCGAGTTTTCTTCTGCCAAACAAATGACCT 
ACCACAATAACCAGTTTGTTTTGGGATTCTAGGGGGATCGGGG 



< 



PROMOTER 



_HSP 70 



2700 



ATCAATTCTAGTATGTATGTAAGTTAATAAAACCCTTTTTTGGAGAATGTAGATTTAA 
AAAAACATATTTTTTTTTTATTTTTTACTGCACTGGATATCA 



2800 



TTGAACTTATCTGATCAGTTTTAAATTTACTTCGATCCAAGGGTATTTGAAGTACCAG 
GTTCTTTCGATTACCTCTCACTCAAAATGACATTCCACTCAA 
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2900 



AGTCAGCGCTGTTTGCCTCCTTCTCTGTCCACAGAAATATCGCCGTCTCTTTCGCCGC 
TGCGTCCGCTATCrCTTTCGCCACCGTTTGTAGCGTTACCTA 



3000 

GCGTCAATGTCCGCCTTCAGTTGCACTTTGTCAGCGGTTTCGTGACGAAGCTCCAAG 
CGGTTTACGCCATCAATTAAACACAAAGTGCTGTGCCAAAACT 

3100 

CCTCTCGCITCTTATTTTTGTrrGTTTTTT 

TGGGTAAGCAGGGGAAAGTGTGAAAAATCCCGGCAATGGGC 

3200 



CAAGAGGATCAGGAGCTATTAATTCGCGGAGGCAGCAAACACCCATCTGCCGAGCA 
TCTGAACAATGTGAGTAGTACATGTGCATACATCTTAAGTTCAC 

3300 

TTGATCTATAGGAACTGCGATTGCAACATCAAATTGTCTGCGGCGTGAGAAGTGCGA 
CCCACAAAAATCCCAAACCGCAATCGCACAAACAAATAGTGAC 

3400 

ACGAAACAGATTATTCTGGTAGCTGTGCTCGCrATATAAGACAATTTTTAAGATCAT 
ATCATGATCAAGACATCTAAAGGCATTCATTTTCGACTACATT 

3500 



CTITTTTACAAAAAATATAACAACCAGATATTTTAAGCTGATCCTAGATGCACAAAA 
AATAAATAAAAGTATAAACCTACTTCGTAGGATACTTCGTTTT 

3600 

GTTCGGGGTTAGATGAGCATAACGCTTGTAGTTGATATTTGAGATCCCCTATCATTG 
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CAGGGTGACAGCGGAGCGGCTTCGCAGAGCTGCATTAACCAGG 

3700 

GCTTCGGGCAGGCCAAAAACTACGGCACGCTCCTGCCACCCAGTCCGCCGGAGGAC 
TCCGGTTCAGGGAGCGGCCAACTAGCCGAGAACCTCACCTATGC 

3800 

CTGGCACAATATGGACATCTTTGGGGCGGTCAATCAGCCGGGCTCCGGATGGCGGC 
AGCTGGTCAACCGGACACGCGGACTATTCTGCAACGAGCGACAC 

3900 

ATACCGGCGCCCAGGAAACATTTGCTCAAGAACGGTGAGTTTCTATTCGCAGTCGGC 
TGATCTGTGTGAAATCTTAATAAAGGGTCCAATTACCAATTTG 

4000 

AAACTCAGTTTGCGGCGTGGCCTATCCGGGCGAACTTTTGGCCGTGATGGGCAGTTC 
CGGTGCCGGAAAGACGACCCTGCTGAATGCCCTTGCCTTTCGA 

4100 

TCGCCGCAGGGCATCCAAGTATCGCCATCCGGGATGCGACTGCTCAATGGCCAACCT 
GTGGACGCCAAGGAGATGCAGGCCAGGTGCGCCTATGTCCAGC 

4200 

agGatgacctctttatcggctccctaacggccagggaacacctgattttccaggcca 
tggtgcggatgccacgacatctgacgtatcggcagcgagtggc 

' 4300 

CCGCGTGGATCAGGTGATCCAGGAGCTTTCGCTCAGCAAATGTCAGCACACGATCAT 
CGGTGTGCCCGGCAGGGTGAAAGGTCTGTCCGGCGGAGAAAGG 

4400 
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AAGCGTCTGGCATTCGCCTCCGAGGCACTAACCGATCCGCCGCTTCTGATCTGCGAT 
GAGCCCACCTCCGGACTGGACTCATTTAGCGCCCACAGCGTCG 

4500 

TCCAGGTGCTGAAGAAGCTGTCGCAGAAGGGCAAGACCGTCATCCTGACCATTCAT 
CAGCCGTCTTCCGAGCTGTTTGAGCTCTTTGACAAGATCCTTCT 

4600 

GATGGCCGAGGGCAGGGTAGCTtTCTTGGGCACTCCCAGCGAAGCCGTCGACTTCTT 
TTCCTAGTGAGTTCGATGTGTTTATTAAGGGTATCTAGCATTA 

4700 

CATTACATCTCAACTCCTATCCAGCGTGGGTGCCCAGTGTCCTACCAACTACAATCC 
GGCGGACTTTTACGTACAGGTGTTGGCCGTTGTGCCCGGACGG 

4800 

GAGATCGAGTCCCGTGATCGGATCGCCAAGATATGCGACAATTTTGCTATTAGCAAA 
GTAGCCCGGGATATGGAGCAGTTGTTGGCCACCAAAAATTTGG 

4900 

AGAAGCCACTGGAGCAGCCGGAGAATGGGTACACCTACAAGGCCACCTGGTTCATG 
CAGTTCCGGGCGGTCCTGTGGCGATCCTGGCTGTCGGTGCTCAA 

5000 

GGAACCACTCCTCGTAAAAGTGCGACTTATTCAGACAACGGTGAGTGGTTCCAGTGG 
AAACAAATGATATAACGCTTACAATTCTTGGAAACAAATTCGC 

5100 

TAGATTTTAGTTAGAATTGCCTGATTCCACACCCITCTTAG'rrri'l'rrCAATGAGATG 
TATAGTTTATAGTTTTGCAGAAAATAAATAAATTTCATTTAA 



FIG. 22 



5200 



CTCGCGAACATGTTGAAGATATGAATATTAATGAGATGCGAGTAACATTTTAATTTG 
CAGATGGTTGCCATCTTGATTGGCCTCATCTTTTTGGGCCAAC 

5300 

AACTCACGCAAGTGGGCGTGATGAATATCAACGGAGCCATCTTCCTCTTCCTGACCA 
ACATGACCTTTCAAAACGTCTTTGCCACGATAAATGTAAGTCT 

5400 

TGTTTAGAATACATTTGCATATTAATAATTTACTAACTTTCTAATGAATCGATTCGAT 
TTAGGTGTTCACCTCAGAGCTGCCAGTTTTTATGAGGGAGGC 

5500 

CCGAAGTCGACITTATCGCTGTGACACATACTTTCrGGGCAAAACGATTGCCGAATT 
ACCGCTTTTTCTCACAGTGCCACTGGTCTTCACGGCGATTGCC 

5600 

TATCCGATGATCGGACTGCGGGCCGGAGTGCTGCACTTCTTCAACTGCCTGGCGCTG 
GTCACTCTGGTGGCCAATGTGTCAACGTCCTTCGGATATCTAA 

5700 

TATCCTGCGCCAGCTCCTCGACCTCGATGGCGCTGTCTGTGGGTCCGCCGGTTATCAT 
ACCATTCCTGCTCTTTGGCGGCTTCTTCTTGAACTCGGGCTC 

5800 

GGTGCCAGTATACCTCAAATGGTTGTCGTACCTCTCATGGTTCCGTTACGCCAACGA 
GGGTCTGCTGATTAACCAATGGGCGGACGTGGAGCCGGGCGAA 

5900 , 

ATTAGCTGCACATCGTCGAACACCACGTGCCCCAGTTCGGGCAAGGTCATCCTGGA 
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GACGCTTAACTTCTCCGCCGCCGATCTGCCGCTGGACTACGTGG 



6000 

GTCTGGCCATTCTCATCGTGAGCTTCCGGGTGCTCGCATATCTGGCTCTAAGACTTCG 
GGCCCGACGCAAGGAGTAGCCGACATATATCCGAAATAACTG 

6100 

CTTGTTTTTTTTTTTACCATTATTACCATCGTGTTTA 
GCTAATGTAATTATATTTGTGCCAATAAAAACAAGATATGA 

6200 

CCTATAGAATACAAGTATTTCCCCTrCGAACATCCCCACAAGTAGACTTTGGATTTG 
TCTTCTAACCAAAAGACTTACACACCTGCATACCTTACATCAA 

6300 

AAACTCGTTTATCGCTACATAAAACACCGGGATATATTTTTTATATACATACTTTTCA 
AATCGCGCGCCCTCTTCATAATTCACCTCCACCACACCACGT 

6400 

TTCGTAGTTGCTCTTTCGCTGTCTCCCACCCGCTCTCCGCAACACATTCACCTTTTGTT 
CGACGACCTTGGAGCGACTGTCGTTAGTTCCGCGCGATTCG 

.6500 

GTTCGCTCAAATGGTTCCGAGTGGTTCATTTCGTCTCAATAGAAATTAGTAATAAAT 
ATTTGTATGTACAATTTATTTGCTCCAATATATTTGTATATAT 

* 6600 

TTCCCTCACAGCTATATTTATTCTAATTTAATATTATGACTTTTTAAGGTAATT^ 
TGACCTGTTCGGAGTGATTAGCGTTACAATTTGAACTGAAA 

6700 
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GTGACATCCAGTGTTTGTTCCTTGTGTAGATGCATCTCAAAAAAATGGTGGGCATAA 
TAGTGTTGTTTATATATATCAAAAATAAGAACTATAATAATAA 



6800 

GAATACATTTAATTTAGAAAATGCTrGGATTTCACTGGAACTAGAATTAATTCGGCT 
GCTGCTCTAAACGACGCATTTCGTACTCCAAAGTACGAATTTT 



6900 

ITCCCTCAAGCTCITATTTTCATTAAACAATGAACAG 
TATTGTTTACATAAATGATTTTITTrACTATrcAAA 

7000 

TGTTTGTGTACTCCCACTGGTATAGCCTTCTTTTATCTTTTCT 
CTTTACTAGGTACGGCATCTGCGTTGAGTCGCCTCCTTTTA 



7100 

AATGTCTGAGCTITTGCAGGTGCAGCCrrTCCACrGCGAATCTTTAAAGTGGGTATCA 
CAAATTTGGGAGTTTTCACCAAGGCTGCACCCAAGGCTCTGCT 

7200 

CCCACAATTTTCTCTTAATAGCACACTTCGGCACGTGAATTAATTTTACTCCAGTCA^ 
AGCITTGCAGCAAAATTTGCAATATTTCATTTTTTTTTATTC 



7300 

CACGTAAGGGTTAATGTTTTCAAAAAAAAATTCGTCCGCACACAACCTTTCCTCTCA 
ACAAGCAAACGTGCACTGAATTTAAGTGTATACTTCGGTAAGC 

7400 

TTCGGCTATCGACGGGACCACCTTATGTTATTTCATCATGGGCCAGACCCACGTAGT 
CCAGCGGCAGATCGGCGGCGGAGAAGTTAAGCGTCTCCAGGAT 
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7500 

GACCTTGCCCGAACTGGGGCACGTGGTGTTCGACGATGTGCAGCTAATTTCGCCCGG 
CTCCACGTCCGCCCATTGGTTAATCAGCAGACCCTCGTTGGCG 

7600 

TAACGGAACCATGAGAGGTACGACAACCATTTGAGGTATACTGGCACCGAGCCCGA 
GTTCAAGAAGAAGGCGTTTTTCCATAGGCTCCGCCCCCCTGACG 

7700 

AGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAA 
AGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCC 

7800 

TGTTCGGACCCTGCCGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTG 
GCGCTTTCTCAATGCTCACGCTGTAGGTATCTCAGTTCGGTG 

7900 

TAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGC 
TGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAA 

8000 

GACACGACTTATCGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGG 
TATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTA 

8100 

CGGCrACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTT 
CGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACC 

8200 

GCTGGTAGCGGTGGTTTTTTTGTTTGCAAGCAGCAGATTACGCGCAGAAAAAAAGG 
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ATCTCAAGAAGATCCTTTGATCTTTTCTACGGGGTCTGACGCTC 

8300 

AGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATC 
TTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATGAAT 

8400 

CTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGC 
ACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCC 



8500 . 

TGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGT 
GCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATT 

8600 

TATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACT 
TTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAG 



8700 

AGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGCCATTGCTACAGGCAT 
CGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCC 



8800 

GGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTT 
AGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCG 



- 8900 

CAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCATC 
CGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTC 



9000 
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) 



) 



ATTCTGAGAATAGTGTATGCGGCGACCGAGTTGCTCTTGCCCGGCGTCAATACGGGA 
TAATACCGCGCCACATAGCAGAACTTTAAAAGTGCTCATCATT 

9100 

GGAAAACGTTCTTCGGGGCGAAAACTGTCAAGGATCTTACCGCTGTTGAGATCCAGT 
TCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTT 

9200 

TTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAA 
AAGGGAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTT 

9300 

CCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATA 
TTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGC 

. 9400 

ACATTTCCCCGAAAAGTGCCACCTGACGTCTAAGAAACCATTATTATCATGACATTA 
ACCTATAAAAATAGGCGTATCACGAGGCCCTTTCGTCTCGCGC 

9500 

GTTTCGGTGATGACGGTGAAAACCTCTGACACATGCAGCTCCCGGAGACGGTCACA 
GCTTGTCTGTAAGCGGATGCCGGGAGCAGACAAGCCCGTCAGGG 

9600 

CGCGTCAGCGGGTGTTGGCGGGTGTCGGGGCTGGCTTAACTATGCGGCATCAGAGC 
AGATTGTACTGAGAGTGCACCATATGCGGTGTGAAATACCGCAC 

9700 

CGAATCGCGCGGAACTAACGACAGTCGCTCCAAGGTCGTCGAACAAAAGGTGAATG 
TGTTGCGGAGAGCGGGTGGGAGACAGCGAAAGAGCAACTACGAA 
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9800 



ACGTGGTGTGGTGGAGGTGAATTATGAA.GAGGGCGCGCGATTTGAAAA.GTATGTAT 
ATAAAAAATATATCCCGGTGTTTTATGTAGCGATAAACGAGTTT 



9900 



TTGATGTAAGGTATGCAGGTGTGTAAGTCTTTTGGTTAGAAGACAAATCCAAAGTCT 
ACTTGTGGGGATGTTCGAAGGGGAAATACTTGTATTCTATAGG 



10000 



TCATATCTTGTTTTTATTGGCACAAATATAATTACATTAGCTTTTTGAGGGGGCAATA 
AACAGTAAACACGATGGTAATAATGGTAAAAAAAAAAACAAG 



10100 



CAGTTATTTCGGATATATGTCGGCTACTCCTTGCGTCGGGCCCGAAGTCTTAGAGCC 
AGATATGCGAGCACCCGGAAGCTCACGATGAGAATGGCCAGAC 



10200 



CATGATGAAATAACATAAGGTGGTCCCGTCGGCAAGAGACATCCACTTAACGTATG 
CTTGCAATAAGTGCGAGTGAAAGGAATAGTATTCTGAGTGTCGT 



10300 



ATTGAGTCTGAGTGAGACAGCGATATGATTGTTGATTAACCCTTAGCATGTCCGTGG 
GGTTTGAATTAACTCATAATATTAATTAGACGAAATTATTTTT 

AAAGTTTTATTTTTAATAATTTGCGAGTACGCA ( SEQ ID NO: 68) 
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Natural piggyBac orf 1 ATGGGTAGTT CTTTAGACGA TGAGCATATC CTCTCTGCTC TTCTGCAAAG 

Optimized piggyBac orf \ ATGGGTAGca gccTgGAtGA TGAaCATATC CTgagcGCgC TgCTGCAgAG 

- Natural piggyBac orf 51 CGATGACGAG CTTGTT6GTG AGGATTCTGA CAQTC5AAATA TCAGATCACG 

Optimized piggyBac orf 51 CGAcGAcGAa CTgGTTGGTG AaGATagcGA cAGcGAAATc agfcGATCACG 

Natural piggyBac orf 101 TAAGTGAAGA TGACGTCCAG AGCGATACAG AAGAAGCGTT TATAGATGAG 

Optimized piggyBac orf 101 TgAGcGAAGA cGAcGTlCAG AGCGATACcG AAGAAGCGTT cATcGAcGAa 

Natural piggyBac orf 151. GTACATQAAG TGCAGCCAAC GTCAAGCGGT AGTGAAATAT TAGACGAACA 

Optimized piggyBac orf 151 GTlCAcGAAG TGCAGCCgAC cagcAGCGGT AGcGAAATcc Tg&AtGAACA 

Natural piggyBac orf 201 AAATGTTATT GAACAACCAG GTTCTTCATT GGCTTCTAAC A6AATCTTGA 

Optimized piggyBac orf 201 gAAcGTTA Tc GAACAgCCgG GTagcagccT GGCflagcAAC cGtATCcTGA 

Natural piggyBac orf 251 CCTTGCCACA GAGGACTATT AGAGGTAAGA ATAAACATTG TTGGTCAACT 

Optimized piggyBac orf 251 CCcTGCCgCA GcGcACcATc cGt GGTAA a A AcAAACAcTG TTGGagcACc 

Natural piggyBac orf 301 TCAAAGTCCA CGA6GCGTA6 CCGAGTCTCT GCACTQAACA TTGTCAGATC 

Optimized piggyBac orf 301 agcAAaagCA CccGcCGTAG CCGtGTt.agc GCgCTGAACA TTGTIcGtag 

Natural piggyBac orf 351 TCAAAGA GGT CCGACGCGTA TGTGCCGCAA TATATATGaC CCACTTTTAT 

Optimized piggyBac orf 351 cCAgcGtGGT CCGA.CcCGTA TGTGCCGCAA cATcTAcGAt CCgCTgcTgT 

Natural piggyBac orf 401 GCTTCAAACT ATTTTTTACT GATGAGATAA TTTCGGA AA T TGTAAAATGG 

Optimized piggyBac orf 401 GCTTCAAACT gTTcTTcACc GATGAaATcA Tc age GA AA T cGTgAAATGG 

Natural piggyBac orf 451 ACAAA TGCTG AGATATCATT GAAACGTCGG GAATCTATGA CAGGTGCTaC 

Optimized piggyBac arf 4-51 ACcAAcGCcG AaATcagccT GAAACGTCGc GAAagcATGA CcGGcGCgAC 

Natural piggyBac orf 501 ATTTCGTGAC ACGAATGAAG ATGAAATCTA TGCTTTCTTT GGTATTCTGG 

Optimized piggyBac orf 501 cTTcCGcGAt AC cAAcGAaG ATGAaATCTA cGCcTTCTTc GGTATcCTGG 

Natural piggyBac orf 551 TAATGACAGC AGTGA GA AAA GATAACCACA TGTCCACAGA TGACCTCTTT 

Optimized piggyBac orf 551 TgATGACcGC gGTGcGl AAA GATAACCACA TGagCAC c GA TGAtCTgTTT 

Natural piggyBac orf 601 GA TC GA TC TT TGTCAATGGT GTACGTC TCT GTAATGAGTC GTGATCGTTT 

Optimized piggyBac orf 601 GA TCG t a g c c TGagcATGGT tTACGTtagc GTtATGAGcC GtGAcCGTTT 

Natural piggyBac orf 651 TGA TTTTTTG A TACGA TGTC TTAGAATGGA TGACAAAAGT ATACGGCCCA 

Optimized piggyBac orf 651 cGAT.TTTcTG ATcCGtTGTC TgcGlATGGA TGAtAAAAGc ATcCGcCCgA 

Naiurai piggyBac orf 701 CACTTCGAGA AAACGATGTA TTTACTCCTG TTAGAAAAAT ATGGGATCTC 

Optimized piggyBac orf 701 CcCTgCGcGA AAACGA TGTg TTcACcCCgG TTcGcAAAAT cTGGGATCTg 



Conk 



ssasss* ssssas sssss: esses: sskks j-zsksss 
zsasss* s\ iKir-ss ssiiss st^i sksssk cnsssa 

MxuralpiggyBacoTf 851 TCCCAAACAA GCCAAGTAAG TATGGAATAA AAaTCCTCat „ 
851 TCCCgAACAA aCCgAGc AA a TAcGGtAT^A A^ATC^^ G^T^Gt1a = 

hTatural piggyBaa arf SOI AGTGGTACGA AGTATATGAT AAATGGAATG CCTTATTTrr a/-»^,~. 

O prtnaBdpigsyBacorf m AGcGGTACcA A g TA c A TGA T c A A c GG^ A TG CC^A^TGG gT^GGTAC 

3»S5^JSSf cggtagctgt C^AaIaT^ q CTGTGA c a A IZ^Z 

5=SaS^ JS I^ATCCC^ TGGCgAAAAA CcT^CTgCAg ' GAACCGTATA A^AO^T 

S3335K,2£ essss is™: sxssss 

ssa^^^s^r. assess 

SMS^^S^^ ACCGAAaCCG T^It'It 

s=sa^sjis:s^sss:s:3 sis:™ asssiiss ssssss 

SSSi35SS2W X301 1 Aat^G^AT i^ 7 Ti T ir ^ ACTWAG 3CGGAGTGGA CACGC TAGAC 
y p ggyjjac orf 1301 AgATGGTTAT GTAcTAcAAc CAgACcAAAG GCGG t GTGGA cACcCTgGA T 

o7Z£Z S * B %° r/ - ^ CAAATGTGTT CTGTGA TGAC CTGCAGTAGG AAGACGAATA GGTGGCCTAT 
O ptomzaipxssyBacorJ CAgA TGTGc a gcQTtATGAC CTGCAGccGo AAaACcAAcc GcTGGCC gAT 

Neurol piggySncorf 1401 GGCATTATTG TACGGAATGA TAAACATTGC CTGCATAAAT TCTTTTaTTa 
Oprimzed piggy Bac orf 1401 GGCgcTgcTG TACGGtATGA TcAACAToGC CTGCATcaaI ISIttTaJcA 

Neural P l ?S y 3 ac or/ 1451 TATACAGCCA TAA TGTCAGT AGCAAGGGAG AAAAQGTTCA^AAGTrrrA.A 
Optrmizea. piggyBac tyrf 1451 TcTACAGCCA TAA cGT t AGc AGCAAaGGtG AAAAaGTTCA gAGcCGCAAA 



assttsa, sk aasssa: sassre siaasa := c ?n^ zxssz 

MxuralpiggyBacorf 1601 ATA TTTTGCC AAATGAAGTG CCTGGTACAT CAGA TGACAG TArTr«^,„ 
^rtSto,./ .«> AcATecTGCC gAA CGAAGTG CC 3 GGTActI gcGATGA^AG cAC cG^A A G A ^ 

"SSaSES^ JS CCgGTgA TGA AAAAACGTAC I™*! ?™gX jeAAAA Tc * G 

S=ass52w;s; jesss; asssre g^srs? iss IS s s; ™i 



FIG. 24 



pCaSpeR-hs-orf 



BamU I 



BamU I 



piggyBac ORF fa/? 70 mini-white p-TR AmpVori p-TR 



B 



p(PZ)-Bac-EYFP 



(Not MHind III) 



LacZ 



(Hind \U/Sal 1) (Jfta I/Spe I) J$pe MXba I) 



hsp70 KanVori 3xP3-EYFP 3TR 5TR 



C 



pBSII-ITRLlk-ECFP 



Apal Xhol 



EcoRl 



J- 



fl origin 5'TR spacer 3'TR 3xP3-ECFP ori 



Amp' 



11 pXL-BacII-ECFP 



EcoRl Spel Bglll Xhol Apal 



fl origin 3'TR 3xP3-ECFP 5'TR ori Amp r 



pIAO-P/L-589bp 



— T 



FIG. 25 

R4 R3 R2 Rl 5' TR Stuffer 3* TR LI L2 L3 L4 L5 



p(PZ)-Bac-EYFP 

pBSII-ECFP-Rl/L5 

pBSII-ECFP-R2/L5 

pBSII-ECFP-R3/L5 

pBSII-ECFP-R4/L5 

pBSII-ECFP-R4/L4 

pBSII-ECFP-R4/L3 

pBSII-ECFP-R4/L2 

pBSII-ECFP-R4/Ll 

pBSII -ITR1.1K-ECFP 

pBSII-ECFP-R-TR/L 

pBSII-ECFP-Rl/L 

pBS-pBac/DsRed 

P Bac{3xP3-ECFPafm} 



R R-TR 



L 



35bp 



63bp 



01 bp (66+35) 
(241+35) 



1018bp (63+955) 



CZZI 
276bp 



1018bp (63+955) 



419bp (384+35) 



1018bp (63+955) 



567bp (542+35) 



101 8bp (63+955) 



567bp (542+35) 



882bp (63+819) 



567bp (542+35) 



J 



649bp (63+586) 



567bp (542+35) 



441bp (63+378) 



567bp (542+35) 188bp (63+125) 



I I 



3 11 bp (276+35) 23 5bp (63+172) 

35bp 235bp (63+172) 
101 bp (66+35) 

37bp (2+35) 



235bp (63+172) 



674bp (639+35) 



99bp (63+36) 



Transformation 
Frequency 

0.6% 



8.9% 



12.5% 



15.0% 



12.9% 



10.7% 



9.7% 



10.8% 



0% 



13.9% 



0% 



0% 



0% 



12.9% 



1050bp (63+987) 



FIG. 26 



115bp 



240bp 



Rl 



MF34 



5'TR 

— c=c 




MF34 



Rl 



945bp 



L 

* I 3 T TR 



3xP3-ECFP 



B 



I 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 



a: PCR using IFP2_R1 +MF34 
b: PCR using IFP2 L + MF34 
c: PCR using IFP2 R1 + IFP2JL 
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